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from  their  autocorrelation  functions.  Simulations  showed  the  closed-form 
solution  to  be  very  sensitive  to  noise  compared  with  Iterative  Fourier  transform 
algorithms,  particularly  If  the  corners  of  the  object  are  dim.  The  uniqueness 
proof  Is  Important,  however,  because  It  Is  useful  for  predicting  support 
constraints  for  which  Iterative  reconstruction  Is  facilitated.  Several 
potential  constraints  for  use  In  reconstruction  algorithms  were  examined 
briefly,  but  support  and  nonnegativity  are  the  only  two  constraints  that  have 
been  extensively  exploited.  In  the  case  of  illumination  with  an  active  system, 
the  edges  of  the  Illumination  pattern  are  unavoidably  tapered  which  can  cause 
algorithm  convergence  problems;  however,  this  problem  was  largely  overcome  by 
modification  to  the  Iterative  transform  algorithm  using  an  expanding  mask. 
Successful  reconstructions  were  obtained  In  the  presence  of  large  amounts  of 
edge  tapering.  Image  quality  was  measured  as  a  function  of  both  edge  tapering 
and  noise.  Alternative  reconstruction  algorithms  were  studied,  Including 
various,  gradient  search  algorithms  (for  which  analytic  expressions  for  the 
gradient  of  the  error  metric  were  derived)  and  a  modeling  approach,  but  they 
have  not  yet  been  developed  to  the  point  where  they  out-perform  the  Iterative 
transform  algorithm.  A  comparison  of  deterministic  and  stochastic  approaches  to 
reconstruction  was  also  made,  A  laboratory  experiment  using  active  coherent 
Illumination  was  performed  In  which  a  rough  object  was  Illuminated  by  a  laser 
with  a  known  (triangular)  Illumination  pattern  and  the  far-fleld  (Fourier) 
Intensity  pattern  was  detected.  Using  the  Iterative  transform  algorithm 
employing  knowledge  of  the  support  of  the  Illumination  pattern,  an  Image  was 
reconstructed  from  the  Fourier  Intensity  data.  The  reconstructed  Image  compared 
favorably  with  a  conventional  "ground  truth"  Image,  giving  experimental 
confirmation  of  the  analytical  and  simulation  results.  Data  was  also  taken  In  a 
passive  Incoherent  experiment  In  which  an  object  was  Imaged  through  a  simulated 
segmented-aperture  telescope  for  which  the  segments  were  misaligned.  A  specific 
tactical  application  using  10.6/im  laser  Illumination  was  analyzed  and  a  system 
architecture  was  developed.  It  was  determined  that  the  reduced-tolerance 
Imaging  concept  would  be  useful  for  obtaining  finer  resolution  with  a  simpler 
system  and  with  less  Impact  on  the  platform  geometry  than  would  be  possible  by 
conventional  means. 

In  summary,  the  studies  performed  thus  far  Indicate  that  the  reduced- 
tolerance  Imaging  concept  Is  a  promising  approach  worthy  of  continued 
consideration  for  reducing  size,  weight,  complexity  and  cost  of  fine-resolution 
Imaging  systems. 
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SECTION  1 

INTRODUCTION  AND  OVERVIEW 


1 . 1  BACKGROUND 

In  many  Imaging  scenarios  that  require  fine  resolution  at  long 
ranges,  phase  errors  limit  the  achievable  resolution  and  prevent 
diffraction-limited  Imaging.  The  phase  errors  may  arise  from  a  variety 
of  sources,  Including  atmospheric  turbulence,  misaligned  or  aberrated 
optics,  motion  compensation  errors,  local  oscillator  errors,  and 
waveform  generator  errors.  The  conventional  approach  for  obtaining 
diffraction-limited  Imagery  Is  to  build  Increasingly  more  complex  sensor 
hardware  having  tight  tolerances  on  Its  various  components  to  achieve 
the  desired  phase  stability. 

An  alternative  approach  Is  to  build  hardware  having  reduced 
tolerances  on  Its  phase  stability,  and  correct  for  the  phase  errors  by 
employing  a  phase  retrieval  algorithm  In  a  post-processing  stage.  In 
some  Instances  a  sensor  can  be  used  that  Is  capable  of  measuring 
Intensity  only  and  does  not  measure  the  phase.  In  either  case  a  phase 
retrieval  algorithm  Is  used  to  retrieve  the  phase.  This  approach  Is 
called  Reduced  Tolerance  Imaging  (RTI).  Using  this  approach,  we  can 
potentially  achieve  diffraction-limited  Imaging  using  a  sensor  system 
that  Is  simpler,  cheaper,  lighter-weight  and  smaller  than  a  conventional 
sensor. 

In  order  for  a  phase  retrieval  algorithm  to  work,  It  Is  necessary  to 
have  some  form  of  a  priori  Information  about,  or  constraints  on,  the 
Image.  Examples  of  such  constraints  that  have  been  useful  In  the  past 
are  nonnegativity  (applicable  to  incoherent  Imaging)  and  knowledge  of 
the  object's  support  (knowing  Its  width  or  shape,  Information  which  Is 
available  for  objects  on  dark  backgrcunds  or  If  one  controls  the  pattern 
of  radiation  that  Illuminates  the  object). 
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Several  Important  Issues  must  be  addressed  to  make  the  RTI  concept 
feasible.  Constraints  must  be  found  that  are  powerful  enough  to  ensure 
that  the  retrieved  phase  and  the  reconstructed  Image  are  uniquely 
related  to  the  measured  data.  The  relationship  between  the 
reconstructed  Image  and  the  measured  data  must  be  robust  enough  that  It 
Is  not  overly  sensitive  to  noise  or  other  Imperfections  In  the  data  or 
constraints.  Reconstruction  algorithms  must  be  found  that  converge 
reliably  to  a  solution  with  a  reasonable  amount  of  computation  and  In 
the  presence  of  realistic  amounts  of  noise. 

This  report  describes  the  results  of  a  2-year  program  for  Initial 
development  of  the  Reduced  Tolerance  Imaging  concept. 

1.2  OVERVIEW  OF  ACCOMPLISHMENTS 

In  this  section  the  principal  results  of  the  RTI  program  will  be 
briefly  summarized.  They  are  reported  In  detail  In  the  sections  and 
appendices  that  follow. 

We  would  like  to  know  how  well  we  could  ever  hope  to  reconstruct  an 
Image  from  the  given  data  and  constraints.  Then  we  would  know  whether 
current  reconstruction  algorithms  are  good  enough  or  further  development 
Is  needed.  We  would  also  be  able  to  evaluate  and  compare  various 
measurement  schemes  without  having  to  develop  reconstruction  algorithms 
for  each.  This  can  be  done  using  estimation-theoretic  lower  bounds  on 
the  reconstruction  errors.  The  Cramer-Rao  lower  bound  was  derived  for 
the  case  of  far-fleld  Intensity  measurements  with  additive  Gaussian 
noise.  The  lower  bound  was  computed  and  compared  with  actual  errors 
experienced  In  Imagery  reconstructed  from  simulated  data.  These  results 
demonstrate  the  usefulness  of  estimation  theory  for  predicting  system 
performance.  Cramer-Rao  analysis  was  also  performed  for  a  second 
situation,  In  which  complex-valued  measurements  of  the  Fourier  transform 
(fields)  are  measured,  and  the  measurements  are  corrupted  both  by  phase 


errors  and  by  additive  noise.  Section  2  and  Appendices  A  and  B  describe 
these  results. 

For  discrete,  or  sampled,  objects  of  a  certain  type,  a  closed-form 
recursive  reconstruction  algorithm  was  developed.  It  reconstructs  an 
Image  from  the  autocorrelation  function  which  Is  obtained  by  Inverse 
Fourier  transforming  the  measured  Fourier  Intensity  data.  Unfortunately 
the  closed-form  reconstruction  algorithm  was  found  to  be  very  sensitive 
to  noise,  especially  if  the  corner  or  vertex  points  within  the  object 
are  dim  compared  with  Interior  points.  Although  the  closed-form 
reconstruction  algorithm  Is  of  little  practical  use  because  of  Its 
sensitivity  to  noise,  It  has  provided  valuable  Insights  Into  the 
reconstruction  problem.  It  constitutes  a  uniqueness  proof  for  the  class 
of  objects  for  which  It  Is  applicable  and  suggests  Illumination  pattern 
shapes  that  are  advantageous.  These  results  are  described  In  Section  3 
and  Appendices  C  and  D. 

Since  Image  reconstruction  with  degraded  Fourier  phase  or  no  Fourier 
phase  requires  a  priori  constraints  on  the  object,  It  Is  Imperative  that 
object  constraints  that  are  sufficiently  powerful  and  robust  be  found. 
The  vast  majority  of  the  work  to  date  has  concentrated  on  two 
constraints:  support,  or  shape  (which  occurs  naturally  for  Imaging 
satellites  and  may  be  forced  by  an  Illumination  pattern)  and 
nonnegativity  (which  occurs  naturally  for  most  passive  incoherent 
imaging  problems).  Issues  relating  to  these  and  other  potential 
constraints  are  discussed  In  Section  4. 

When  a  support  constraint  Is  imposed  by  using  an  active  Illumination 
pattern  at  the  target  to  achieve  the  desired  known  shape,  the  principal 
problem  is  diffraction  effects  at  the  edges  of  the  Illumination  pattern. 
This  makes  the  Illumination  pattern  fall  off  slowly  and  smoothly,  1 .e . , 
It  Is  tapered,  rather  than  falling  off  abruptly  as  would  be  preferred. 
Sldelobes  of  the  Illumination  pattern  also  prevent  It  from  going  exactly 
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to  zero.  We  have  found  that  reconstruction  Is  much  easier  when  there  Is 
little  or  no  tapering  of  the  Illumination  pattern.  Previous  versions  of 
the  Iterative  reconstruction  algorithm  were  unsuccessful  In 
reconstructing  complex-valued  Images  when  large  amounts  of  taper  were 
present.  Improved  versions  of  the  algorithm,  employing  an  "expanding 
mask,"  were  developed,  and  this  resulted  In  a  greatly  Improved  result. 
It  consists  of  using  an  unrealistically  small  support  constraint  for 
early  Iterations,  which  forces  the  energy  of  the  Image  to  be  better 
centered  within  the  true  support  constraint,  and  using  progressively 
larger  support  constraints  for  later  Iterations.  It  was  found  that 
Image  quality  degrades  as  the  amount  of  taper  and  of  noise  Increases. 
With  no  taper  present,  the  Image  quality  degrades  slowly  with  Increasing 
noise.  With  tapered  Illumination,  however,  the  Image  quality  degrades 
faster.  It  was  also  shown  that  the  present  algorithms  have  room  for 
further  Improvement  In  terms  of  achieving  better  quality  Imagery  In  the 
presence  of  taper  and  noise.  Section  5  defines  the  Improved  algorithm 
employing  the  expanding  mask,  and  shows  experimental  reconstruction 
results  with  different  types  of  Illumination  patterns  and  for  tapered 
Illumination  and  additive  noise. 

The  Iterative  algorithm  described  In  Section  5  Is  one  of  several 
possible  approaches  to  solving  the  phase  retrieval  problem.  Improved 
algorithms  are  sought  which  are  faster  and  more  robust.  One  family  of 
alternative  algorithms  are  the  gradient  search  algorithms.  They  consist 
of  defining  a  merit  function,  computing  the  gradient  of  the  merit 
function  with  respect  to  a  set  of  parameters,  and  searching  In  the 
parameter  space  for  a  minimum  of  the  merit  function  by  moving  In  the 
direction  of  the  negative  of  the  gradient  (the  global  minimum  of  the 
merit  function  defines  the  solution,  the  reconstructed  Image).  Merit 
functions  that  were  examined  Include  the  amount  by  which  the  modulus  of 
the  Fourier  transform  of  an  object  estimate  differs  from  the  measured 
Fourier  modulus  data  and  the  amount  by  which  an  output  Image  violates 
the  object-domain  constraints.  Parameter  spaces  that  were  Investigated 


Include  the  space  of  object  estimates  and  the  space  of  Fourier  phase 
estimates.  Closed-form  expressions  for  the  gradients  were  derived,  and 
It  was  shown  that  the  entire  gradients  can  be  efficiently  computed  using 
a  small  number  of  fast  Fourier  transforms.  Gradient  search  algorithms 
were  tested  on  the  computer  with  mixed  results.  They  show  promise  and 
should  be  developed  further.  These  results  are  described  In  Section  6 
and  Appendices  E,  F  and  G. 

Another  approach  to  solving  the  phase  retrieval  problem  Is  a 
modeling  approach.  The  complex  Fourier  transform  or  pieces  of  It  are 
modeled  by  a  parameterized  function.  The  measured  Fourier  modulus  Is 
least-squares  fitted  to  the  modulus  of  the  model  to  determine  the 
unknown  parameters.  Then  the  parameters  are  Inserted  Into  the  complex 
model  which  Is  evaluated  to  determine  the  phase.  Attempts  thus  far  to 
make  the  modeling  approach  work  were  unsuccessful.  It  Is  likely  that 
the  models  used  were  not  appropriate  to  the  complex  Fourier  transforms 
of  Interest,  Better  models  would  be  needed  before  further  work  on  this 
approach  should  be  pursued.  This  work  Is  discussed  In  Section  7. 

The  vast  majority  of  the  phase  retrieval  work  prior  to  the  current 
effort  revolved  around  analysis  and  computer  simulations,  since  the 
computer  simulations  Implicitly  assume  a  discrete  model  for  the  object, 
there  Is  a  danger  that  the  real,  continuous  world  might  behave 
differently.  For  this  and  other  reasons  It  Is  very  Important  to 
demonstrate  the  feasibility  of  phase  retrieval  using  real  data  collected 
In  the  laboratory  which  Include  the  Important  real-world  effects.  Two 
experiments  were  performed:  an  active  coherent  experiment  and  a  passive 
Incoherent  experiment.  In  the  active  coherent  experiment  the  target  was 
Illuminated  with  a  laser  beam  pattern  having  the  desired  Illumination 
shape.  A  lens  formed  the  far-fleld  (Fourier  transform)  at  a  plane  where 
the  Intensity  pattern  was  detected.  A  second  channel  Including  Imaging 
optics  was  used  to  form  a  "ground  truth"  Image.  The  far-fleld  detected 
Intensity  was  corrected  for  camera  response  and  square-rooted  to  form  an 


estimate  of  the  modulus  of  the  Fourier  transform  of  the  target  field. 
An  Image  was  reconstructed  from  this  data  using  knowledge  of  the  shape 
of  the  Illumination  pattern.  The  reconstructed  Image  compares  favorably 
with  the  object  and  the  ground-truth  Image.  This  result  demonstrates 
the  feasibility  of  the  phase  retrieval  approach,  at  least  In  a 
controlled,  laboratory  environment.  In  addition  a  passive  Incoherent 
experiment  was  set  up  and  data  was  taken.  An  object  was  Imaged  through 
a  simulated  segmented-aperture  telescope  for  which  the  segments  were 
misaligned.  However  it  was  not  possible  to  complete  the  passive 
experiment  under  the  current  funding.  Section  8  and  Appendix  H  describe 
the  experimental  efforts  and  results. 

A  specific  Imaging  application  and  architecture  was  analyzed.  An 
active  optical  system  using  a  10.6/im  laser  Illuminator  would  detect 
reflected  Intensities  over  a  conformal  detector  array  that  would  be  much 
larger  than  the  largest  aberration-free  telescope  aperture  that  could  be 
practically  mounted  on  an  airborne  platform.  Image  quality  and  size 
dictated  the  laser  power  necessary  for  tactically  useful  ranges,  and  the 
required  laser  power  was  within  the  present  state  of  the  art.  This  Is 
described  In  Section  9. 

For  coherent  Images  we  can  view  the  phase  retrieval  problem  In  a 
stochastic  framework  as  well  as  the  more  conventional  deterministic 
framework.  These  two  approaches  are  compared  In  Appendix  I. 

In  summary,  the  Idea  of  using  phase  retrieval  algorithms  with  Image- 
domain  constraints  for  reducing  the  tolerances  required  on  Imaging 
system  components  continues  to  look  promising.  Requirements  on  transmit 
power  and  Illumination  pattern  tapering  have  been  quantified  and  appear 
to  be  achievable  for  a  specific  10.6/jm  laser-illumination  system  that 
was  analyzed.  Laboratory  experimental  results,  although  preliminary, 
confirm  the  analytical  and  computer  results.  These  promising  results, 
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together  with  the  high  potential  payoff  of  reduced  size,  weight,  cost, 
and  complexity,  suggest  that  further  development  of  the  approach  should 
be  undertaken  for  specific  applications. 

1.3  Recommendations 

It  Is  recommended  that  the  reduced-tolerance  Imaging  concept  be 
further  developed  along  two  major  directions:  application  to  specific 
sensors  and  expanded  understanding  of  the  underlying  theoretical  Issues. 
In  this  section,  research  topics  relating  to  two  of  the  most  promising 
tactical  applications,  an  active  laser  system  and  a  synthetic  aperture 
radar,  will  be  described,  and  the  Issues  relating  to  the  underlying 
theory  will  be  discussed. 

The  use  of  short-wavelength  active  Imaging  systems  using  laser 
Illumination  and  a  conformal  detector  array  served  as  the  dominant  focus 
for  the  research  reported  here  and  Is  the  best  understood  of  the 
potential  tactical  Imaging  scenarios  utilizing  phase  retrieval.  For 
this  application  the  most  Important  next  step  Is  to  expand  upon  the 
preliminary  laboratory  experimental  results  reported  In  Section  8  by 
simulating  in  the  laboratory  the  many  real-world  effects  that  are 
described  In  Section  8.1.1.  Subsequently  the  technique  should  be 
demonstrated  at  longer  ranges  in  rooftop-based  outdoor  test-range 
experiments  leading  toward  an  airborne  demonstration.  Further 
development  of  reconstruction  algorithms  for  Improved  speed  and 
robustness  Is  also  required,  particularly  for  the  case  of  larger  amounts 
of  tapering  of  the  edges  of  the  Illumination  pattern. 

The  second  application  of  the  reduced-tolerance  Imaging  concept 
which  should  be  vigorously  pursued  Is  synthetic  aperture  radar  (SAR), 
particularly  as  a  means  for  overcoming  the  lack  of  adequate  motion 
compensation.  Examples  of  specific  applications  Include  rapidly 
manoeuverlng  airborne  platforms  and  light-weight  platforms  lacking 
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Inertial  navigation  sensors.  The  SAR  applications  differ  from  the 
laser-illumination  systems,  emphasized  In  the  research  reported  here,  In 
some  Important  ways.  For  SAR  we  cannot  as  easily  obtain  a  useful 
Illumination-pattern  support  constraint,  but,  on  the  other  hand,  we  can 
achieve  similar  effects  by  using  no-return  areas  such  as  lakes,  shadows, 
and  smooth  runways  and  roads.  The  no-return  areas  In  SAR  Imagery  will 
occupy  a  much  smaller  fraction  of  the  total  area  of  the  Image  than  the 
non-lllumlnated  areas  In  the  case  of  laser-illuminated  Imagery.  This 
makes  Image  reconstruction  more  difficult  for  the  case  of  SAR.  However, 
for  SAR  we  measure  the  phase  which,  although  It  Is  distorted  by  motion 
compensation  errors,  can  be  used  to  start  the  reconstruction  processing' 
with  a  blurred  Image,  whereas  for  the  laser  Illumination  case  It  was 
assumed  that  there  was  no  phase  Information  at  all.  Furthermore,  the 
SAR  phase  errors  are  primarily  one-dimensional  (along  track)  and  so  the 
number  of  unknowns  Is  far  less  than  for  the  laser  Illumination  case. 
These  factors  should  make  Image  reconstruction  much  easier  for  the  case 
of  SAR.  This  combination  of  less  constraints  but  more  available  phase 
Information  makes  it  difficult  to  predict  how  well  the  reconstruction 
process  will  work  for  a  realistic  SAR  scenario  based  on  the  Initial 
results  shown  In  this  report.  Therefore  the  reduced-tolerance  Imaging 
technique  should  be  developed  specifically  for  the  types  of  constraints 
and  data  available  for  SAR  scenarios  of  Interest.  This  would  Include 
algorithm  development  that  would  make  use  of  no-return  areas  Imbedded  In 
Images,  use  partial  phase  Information  that  Is  measured,  and  take 
advantage  of  the  one-dimensional  nature  of  the  phase  error.  This  should 
first  be  done  for  computer-simulated  data  and  constraints,  and  later  for 
SAR  data  collections  available  at  ERIM.  An  assessment  of  the  types, 
amounts,  and  quality  of  constraints  typically  available  In  SAR  Images  of 
Interest  would  be  Included  in  the  study.  Both  analytically  and  by 
computer  simulation,  a  prediction  of  Image  quality  as  a  function  of 
constraints  and  data  properties  should  be  made.  Finally,  data 
collection  experiments  should  be  defined  and  carried  out  to  demonstrate 
the  concept  In  the  field. 


Underlying  the  reduced-tolerance  Imaging  concept  are  basic  phase 
retrieval  algorithms  and  uniqueness  questions  that  have  been 
Investigated  by  several  groups  of  researchers  around  the  world  for  over 
a  decade.  The  most  successful  algorithm  to  date  Is  the  iterative 
transform  algorithm  developed  at  ERIM  that  was  used  In  the  simulation 
experiments  reported  In  Sections  2.5,  4.1,  and  5.  Sections  3,  6  and  7 
describe  attempts  to  Improve  on  that  algorithm.  Further  work  along 
these  lines  Is  necessary  to  .*r*r'e  at  algorithms  with  Improved  speed  and 
Image  quality.  The  gradient  search  algorithms  described  In  Section  6 
appear  to  be  promising  and  should  be  developed  further.  Newton-Raphson 
and  other  approaches  should  also  be  explored.  Continued  optimization  of 
and  Improved  variations  on  the  Iterative  transform  algorithm  should  be 
pursued,  with  particular  attention  paid  to  characterizing  modes  of 
stagnation  and  developing  techniques  for  overcoming  them. 

The  uniqueness  question  can  be  stated  as  follows \  with  what 
confidence  can  we  say  that  a  reconstructed  Image  does  not  differ 
substantially  from  a  true  Image  of  the  object?  As  reported  In  Section 
3,  It  was  proved  that  certain  Illumination  patterns  guarantee  a  unique 
solution  for  the  case  of  a  sampled  model  of  the  object?  but  It  Is  not 
clear  how  this  carries  over  to  the  real  world  of  continuous  objects. 
The  answer  to  this  question  may  come  from  the  abstract  mathematical 
field  of  analytic  functions  of  several  complex  variables.  However, 
despite  efforts  by  several  workers  In  the  US  and  England,  a 
comprehensive  answer  to  the  uniqueness  question  remains  to  be  developed. 
Additional  questions,  such  as  how  Illumination  pattern  tapering  affects 
the  uniqueness  of  the  reconstructed  Image  should  also  be  addressed.  To 
answer  these  questions,  a  multifaceted  approach  should  be  pursued, 
Including  use  of  the  theory  of  analytic  functions  of  several  complex 
variables,  polynomial  factorization  theory,  and  brute-force  computer 
searches  through  the  solution  space  for  small  Images. 


With  this  combination  of  Investigations  Into  specific  applications 
and  underlying  theory,  the  reduced  tolerance  Imaging  concept  can  be 
developed  to  the  point  of  providing  a  near-term  solution  to  pressing 
sensor-system  problems. 
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SECTION  2 

INFORMATION  THEORETIC  LOWER  BOUNDS 
FOR  PHASE  RETRIEVAL 


2.1  INTRODUCTION 

In  phase  retrieval  problems,  It  Is  desired  to  estimate  the  phase  of 
the  Fourier  transform  of  an  object  given  measurements  of  the  Intensity 
(l.e.,  squared  modulus)  of  the  Fourier  transform.  This  Is  equivalent  to 
estimating  the  object  Itself  because  of  the  Fourier  transform 
relationship.  The  Iterative  Fourier  transform  algorithm  has  had  great 
success  in  making  such  object  estimates  from  Fourier  Intensity  data  and 
object  constraint  Information  [2.1,  2.2].  However,  other  than  through 
empirical  results  [2.3],  It  has  not  been  known  how  the  error  In  the 
object  estimate  depends  on  measurement  noise,  constraint  Information, 
and  other  parameters  describing  the  problem. 

Results  In  estimation  theory  Include  a  number  of  methods  whereby 
lower  bounds  on  the  mean-squared  error  of  the  object  estimate  may  be 
calculated.  These  methods  use  knowledge  of  the  measurement  procedure, 
the  statistics  of  the  object,  and  the  statistics  of  the  noise  process  to 
compute  an  error  lower  bound.  An  Important  feature  Is  that  these 
methods  do  not  require  specification  of  the  algorithm  used  to  compute 
the  object  estimate  from  the  measured  data.  The  lower  bound,  then,  Is 
Independent  of  the  algorithm  and  therefore  Indicative  of  the  best 
possible  estimation  performance  given  the  chosen  measurements  and  the 
underlying  statistics. 

The  Cramer-Rao  lower  bound  Is  the  most  often  used  lower  bound 
because  It  Is  usually  the  least  difficult  to  compute.  It  has  been  used 
in  a  large  number  of  single  and  multiple  parameter  and  time-varying 
waveform  estimation  problems  with  great  success  [2.4],  Algorithms  exist 
which  produce  estimates  that  achieve  the  Cramer-Rao  bound  In  problems  In 


which  the  measurements  are  linearly  related  to  the  parameters  to  be 
estimated,  the  noise  Is  additive,  and  the  statistics  are  Gaussian.  In 
nonlinear  problems  (of  which  phase  retrieval  will  be  an  example) ,  the 
lower  bound  can  usually  be  approached  only  at  high  signal -to-nolse 
ratios  [2.4,  2.5];  nonetheless,  the  lower  bound  Is  generally  regarded  as 
an  Important  first  step  In  evaluating  and  designing  measurement 
procedures  and  parameter  estimation  algorithms  for  these  problems. 

A  specific  phase  retrieval  problem  Is  defined  In  Section  2.2.  In 
Section  2.3,  the  Cramer-Rao  method  for  computing  the  error  lower  bound 
Is  reviewed  and  general  notation  Is  developed.  In  Section  2.4,  an 
explicit  expression  for  the  phase  retrieval  error  lower  bound  Is 
derived.  In  Section  2.5,  a  comparison  Is  given  of  the  lower  bound  to 
errors  found  In  computer  simulations  using  an  Iterative  Fourier 
transform  phase  retrieval  algorithm.  A  second  phase  retrieval  problem 
Is  defined  In  Section  2.6  and  an  expression  for  the  lower  bound  given. 
Conclusions  and  suggestions  for  further  research  are  given  in  Section 
2.7.  A  tutorial  review  of  the  Cramer-Rao  method  Is  presented  In 
Appendix  A.  A  method  for  determining  Cramer-Rao  lower  bounds  In  a  wider 
class  of  phase  retrieval  problems  and  the  specific  result  given  In 
Section  2.6  are  derived  In  Appendix  B. 

2.2  PHASE  RETRIEVAL  PROBLEM  DEFINITION 

From  the  many  combinations  of  possible  phase  retrieval  problems  and 
underlying  assumptions,  the  following  specific  example  Is  chosen.  It  is 
desired  to  estimate  a  two-dimensional,  complex-valued  object  fm  from 
real -valued  measurements  Sp  where  m  ■  (mi,  mg);  mi,  m2  *  0,  1,  ...,  M-l 
and  p  »  (pi ,  pg);  pi ,  P2  ■  0,  1,  ...,  2M-1.  The  measurements  are  re¬ 
lated  to  the  object  by 


and 


(2-2) 


where  Ip  is  the  Intensity  (squared  modulus)  of  the  discrete  Fourier 
transform  of  f #  c  Is  a  proportionality  constant,  Np  Is  additive  noise, 
<m,  p>  -  mipi  +  m2P2/  and  summation  over  m  Implies  the  double  summation 
over  mi  and  m2.  Object  constraint  information  Is  essential  for 
estimating  the  object.  The  weighting  array  wra  Is  explicitly  Included  In 
Eq.  (2-2)  to  allow  arbitrary  support  constraints  to  be  placed  on  the 
object.  For  an  object  of  M  by  M  resolution  elements,  Nyqulst  sampling 
requires  a  measurement  array  of  size  2M  by  2M  because  the  squared 
modulus  has  twice  the  bandwidth  of  the  complex-valued  Fourier  transform. 
It  will  be  convenient  later  to  consider  w,  f,  S,  I,  and  N  as  vectors. 
The  phase  retrieval  problem  Is  to  estimate  the  object  f  given  the  set  of 
measurements  S  and  knowledge  of  the  constraint  that  the  product  wmfm  Is 
zero  wherever  wm  Is  known  to  be  zero. 

This  mathematical  statement  can  represent  a  number  of  applications 
In  which  phase  retrieval  problems  arise.  For  example,  consider  the 
measurement  geometry  shown  In  Figure  2-1.  A  known,  complex-valued, 
monochromatic  wavefront  w(x,  y)  Illuminates  an  unknown,  complex-valued 
object  f(x,  y).  Alternatively,  for  the  wavefront  sensing  problem,  an 
unknown  monochromatic  wavefront  f(x,  y)  may  pass  through  a  known  aper¬ 
ture  having  known  complex  transmittance  w(x,  y).  The  Intensity  I (u,  v) 
In  a  measurement  plane  located  a  distance  R  from  the  object  plane  is 
[2.6]: 


FIGURE  2-1.  MEASUREMENT  GEOMETRY  FOR  PHASE  RETRIEVAL  PROBLEM. 


where  X  Is  the  wavelength  and  It  Is  assumed  that  R  is  sufficiently  great 
that  the  Fraunhofer  approximation  can  be  made.  A  discrete  set  of 
measurements  S  Is  made  with 

Sp  ■  jjT  J  I(u,  v)  du  dv  +  Np  (2-4) 
AA 

where  rj  Is  the  detector  efficiency,  T  is  the  detector  Integration  time, 
AA  Is  *he  area  of  a  detector  element,  Np  Is  the  detector  noise,  and  the 
subscript  p  «  (pi,  P2)  Indexes  over  the  measurement  plane.  A  phase  re¬ 
trieval  method  (e.g.,  an  Iterative  Fourier  transform  algorithm)  would  be 
applied  to  the  measurement  set  S  using  the  object  constraint  provided  by 
the  Illumination  pattern  w  to  give  an  estimate  of  a  sampled  version  of 
the  object  f.  Conversion  of  Eqs,  (2-3)  and  (2-4)  into  discrete  form 
gives,  for  this  application,  a  value  for  the  constant  c  In  Eq.  (2-1)  of 
»;TAA(Aa/XR)2  where  Aa  Is  the  square  area  associated  with  an  object 
sample. 

The  complex-valued  object  f  can  be  written  In  terms  of  its  real  and 
Imaginary  parts, 


f 


fr  + 
m 


If, 


m 


(2-5) 


Equation  (2-2)  then  becomes 


2.3  CRAMER-RAO  LOWER  BOUND 


It  can  be  proven  that  the  variance  of  any  unbiased  estimate  of  a 
component  of  a  random  vector  Is  greater  than  or  equal  to  the 
corresponding  diagonal  element  of  the  Inverse  of  what  Is  called  the 
Fisher  Information  matrix.  The  value  of  the  diagonal  element  Is  the 
Cramer-Rao  lower  bound.  The  elements  of  the  Fisher  Information  matrix 
depend  In  turn  upon  the  second  partial  derivatives  of  the  joint 
probability  distribution  of  the  measurement  vector  and  the  vector  to  be 
estimated.  This  result  Is  proven  primarily  by  the  use  of  the  Schwarz 
Inequality  [2.4]. 

Application  of  the  Cramer-Rao  method  for  determining  lower  bounds 
on  estimation  errors  to  a  specific  problem  must  therefore  begin  with  a 
determination  of  the  statistics  of  the  parameters  to  be  estimated  and  of 
the  noise  [2.4,  2.7].  In  this  analysis,  It  Is  assumed  that  fmr,  fml , 
and  Np  are  each  statistically  Independent,  zero  mean,  Gaussian  random 
variables  with  variances  (<rf)2/2,  (crf)2/2,  and  (c^)2  respectively.  Note 
that  this  Implies  that  fm  is  zero  mean  and  that  the  variance  of  fm 

(1 . e . ,  the  expected  value  of  lfml2)  Is  (<7f)2.  Since  the  Cramer-Rao 

method  can  not  be  directly  applied  to  the  complex-valued  random 
variables  fm,  this  section  develops  a  notation  for  applying  the  method 
to  the  real -valued  f|/  and  fm1 , 

By  the  definition  of  conditional  probability, 

p(S,  f)  *  p(SI f)p(f)  (2-7) 

where  p (S ,  f)  Is  the  joint  probability  density  of  S  and  f,  p(Slf)  Is  the 
conditional  probability  density  of  S  given  f,  and  p(f)  Is  the 

probability  density  of  f.  (Recall  that  f  and  S  are  vectors.)  The 

assumption  of  Gaussian  statistics  gives 
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p(f>  ■  TT  A 


ID  T 


exp 


<g  +  <g 


(2-8) 


and,  using  Eqs.  (2-1)  and  (2-6)  which  Imply  that  p(Slf)  ■  p(N  ■  S  -  cl), 


p(Slf)  -  ]~[  — exp 
P  *Nf2lr 


-  cl£ 


2  a 


N 


(2-9) 


where,  by  Eq.  (2-6),  1  Is  a  function  of  f. 

The  Cramer-Rao  method  continues  by  defining  the  Fisher  Information 
matrix  J  In  terms  of  the  probability  density  functions.  For  the  present 
problem,  where  It  Is  desired  to  estimate  the  statistically  Independent 
real  and  Imaginary  parts  of  f,  a  workable  notation  Is  to  partition  J 
Into  four  submatrices: 


(2-10) 


[jrr 

Jrtl 

Ltr 

J11] 

J  Is  of  dimension  2M2  by  2M2  (representing  the  M2  Independent  fmr  plus 
the  M2  independent  fm'1)  and  each  of  the  subrnatrlces  is  of  dimension  M2 
by  M2.  The  elements  of  the  submatrices  are  defined  by,  for  example 
[2.4,  2.7], 
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(2-11) 
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where  E[»]  denotes  expectation  taken  over  both  f  and  N  and  the  partial 
derivative  holds  S  constant.  The  other  submatrices  are  defined  by 
appropriate  substitution  of  the  superscripts  r  and  1.  It  Is  assumed 
that  these  and  any  other  required  derivatives  exist.  This  assumption  Is 
valid  for  the  phase  retrieval  problem. 

The  Cramer-Rao  method  concludes  by  determining  the  Inverse  J-1  of 
the  Fisher  Information  matrix  J.  The  diagonal  elements  of  J“1  are  the 
desired  lower  bound  on  the  mean-squared  error  of  the  object  estimate  ?. 
From  the  convention  used  to  define  J,  the  upper  left  diagonal  elements 
of  J*1  refer  to  fmr  and  the  lower  right  elements  to  fm1.  If  J"1  Is 
similarly  partitioned  Into  four  submatrices: 


1  „ 
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Kr,l 
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(2-12) 


then  the  Cramer-Rao  lower  bound  (eQm)2  on  the  mean-squared  error, 

E [ I fm  “  fml2]i  the  estimate  of  object  component  fm,  Is  the  sum  of 
the  diagonal  elements  for  f]nr  and  fm^ : 


«2  „rr  ,  »11 
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(2-13) 


This  Is  the  quantity  which  the  following  analysis  seeks.  Strictly,  the 
lower  bound  Is  only  for  asymptotically  unbiased  estimates  of  f.  It  Is 
beyond  the  scope  of  this  work  to  determine  whether  particular  phase 
retrieval  algorithms  give  unbiased  estimates. 

2.4  LOWER  BOUND  FOR  PHASE  RETRIEVAL 

Substituting  Eqs.  (2-8)  and  (2-9)  Into  Eq.  (2-11),  differentiating, 
and  discarding  a  term  with  zero  expected  value  gives  [2.4] 
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(2-14) 
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where  ffmn  Is  the  Kronecker  delta  function.  Similar  results  hold  for  the 
other  submatrices  of  J  except  that  and  J1*'  have  no  $mn  term.  It  Is 
Important  to  note  that  this  result  holds  for  any  function  I  of  the 
parameter  f.  It  does  not  assume  that  the  measurements  are  of  the 
Fourier  Intensity. 


Equation  (2-6)  can  now  be  used  to  compute  the  first  term  on  the 
right  hand  side  of  Eq.  (2-14).  First, 

jj?  ■  *C  £  wj(fj +  ifj)  e*p  [ p>] +  (z-i5> 
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(2-16) 


Taking  the  expected  value,  the  summation  over  k  reduces  to  those  terms 
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for  which  j  ■  k  since  the  fn,r  and  fmi  are  Independent.  The  first  and 
third  terms  in  Eq.  (2-16)  are  also  eliminated  because  the  fmr  and  fm1 
have  equal  variances.  Thus, 
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(2-17) 


Finally,  the  summation  over  p  gives 
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(2-18) 


because 


E  «xp  [-«"  ^  »■  P>]  ■  4M\„  .  (2-19) 

Equation  (2-18)  Is  a  general  expression  for  one  of  the  submatrices 
of  the  Fisher  Information  matrix  J  given  the  assumptions  above.  Similar 
computations  show  that  J11  ■  0rr  and  ■  J1r  ■  0.  In  this  case,  then, 
J  is  diagonal  and  can  be  analytically  Inverted  to  obtain  J"1.  This  Is, 
of  course,  a  result  of  the  discrete  Fourier  transform  nature  of  Eq.  (2- 
6).  Other  phase  retrieval  problems  may  lead  to  nondiagonal  J  matrices 
which  may  be  difficult  or  impractical  to  Invert  analytically. 

Using  Eqs.  (2-13)  and  (2-18),  the  lower  bound  (eom)2  on  the  mean- 
squared  error  In  the  estimate  of  fm  Is: 


(2-20) 
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It  Is,  as  stated  earlier,  Independent  of  the  phase  retrieval  algorithm 
used  to  estimate  f. 

The  notation  of  Eq.  (2-20)  can  be  simplified  by  defining  a  slgnal- 
to-nolse  ratio: 


SNR  - 


{£[cg)z 


where,  by  Eq.  (2-6), 


Equation  (2-20)  then  becomes 
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(2-22) 


(2-23) 


As  would  be  expected,  the  lower  bound  on  the  estimate  reduces  to  the  a 
priori  variance  (erf)2  If  either  fm  Is  not  Illuminated  (wm  =  0)  or  the 
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SNR  Is  zero.  The  lower  bound  also  approaches  zero  as  the  SNR  approaches 
Infinity. 

For  the  case  In  which  the  magnitudes  of  the  support  constraint  w 
are  either  zero  or  one,  Eq.  (2-20)  predicts  that,  If  the  support 
constraint  Includes  a  smaller  part  of  the  M  by  M  object  array  (and 
therefore  Ej I wj 12  decreases),  then  the  error  lower  bound  Increases. 
This  Is  due  to  the  loss  of  signal  as  can  be  seen  from  Eq.  (2-22).  On 
the  other  hand,  If  the  SNR  (In  the  measurement  plane)  Is  held  constant, 
then  Eq.  (2-23)  predicts  that  the  error  bound  decreases.  This  Is 
equivalent  to  sampling  at  greater  than  Nyqulst  rate  In  the  measurement 
array  In  the  Fourier  domain.  The  well-known  decrease  Is  known  as 
compression  gain. 

It  Is  known  that  current  Iterative  phase  retrieval  algorithms  are 
more  successful  In  converging  to  a  solution  for  some  object  support 
constraints  than  for  others  (e.g.,  for  a  triangularly-shaped  pattern 
Imposed  by  w,  the  algorithm  more  readily  finds  a  solution  than  for  a 
square  pattern)  [2.8],  By  a  solution  Is  meant  an  object  estimate  that 
Is  as  close  to  agreeing  with  the  measured  data  and  the  a  priori 
constraints  as  possible.  In  some  cases,  an  algorithm  stagnates  and 
produces  an  output  In  poor  agreement  with  the  data  and  constraints;  such 
an  output  should  not  be  considered  an  object  estimate.  If  there  Is  more 
than  one  solution  that  closely  agrees  with  the  data  and  constraints,  the 
algorithm  may  find  a  solution  that  Is  different  from  the  true  object. 
There  Is  a  tendency  for  Iterative  transform  algorithms  to  find  solutions 
more  readily  for  cases  guaranteed  to  have  unique  solutions  (e.g., 
objects  with  triangular  support  constraints).  However,  when  the 
solution  Is  unique,  we  have  demonstrated  that,  If  a  solution  Is  found 
(l.e.,  the  algorithm  does  not  stagnate  In  poor  agreement  with  the  data 
and  constraints),  then  the  mean-squared  error  Is  Independent  of  the 
shape  of  the  object  support  constraint  [2.9].  From  either  Eq.  (2-20)  or 


(2-23),  It  can  be  seen  that.,  for  a  given  value  of  Ejlwjl2,  the  lower 
bound  (egm)2  depends  only  on  I wml 2  and  not  on  the  two  dimensional 
distribution  of  w  (the  support  constraint).  The  Cramer-Rao  lower  bound 
Is  apparently  a  measure  of  the  error  of  algorithms  which  have  found  a 
reasonably  good  estimate  and  Is  Insensitive  to  lack  of  uniqueness  or  to 
algorithm-dependent  problems  such  as  stagnation. 

2.5  COMPARISON  OF  LOWER  BOUND  TO  CURRENT  ALGORITHM  PERFORMANCE 

Because  It  Is  of  Interest  to*  compare  the  Cramer-Rao  error  lower 
bound  to  phase  retrieval  algorithm  performance,  computer  simulation 
experiments  were  performed.  Uncorrelated,  Gaussian  distributed  random 
numbers  with  zero  mean  and  unit  variance  were  generated  and  used  as  the 
real  and  Imaginary  parts  of  an  M  by  M  object  array  f.  This  array  was 
multiplied  by  a  binary  weighting  array  w  with  a  triangular  shape.  The 
result  was  zero-filled  to  array  size  2M  by  2M,  Fourier  transformed,  and 
magnitude-squared  to  give  the  2M  by  2M  Fourier  Intensity  array  I, 
Additional  uncorrelated,  Gaussian  distributed  random  numbers  with  zero 
mean  were  generated  and  used  as  the  noise  array  N.  Their  variance  was 
varied  to  achieve  different  values  of  the  SNR  [see  Eq.  (2-21)].  Without 
loss  of  generality,  the  constant  c  was  set  equal  to  unity.  I  and  N  were 
summed  to  give  the  Fourier  Intensity  measurement  S. 

An  Iterative  Fourier  transform  algorithm  with  a  combination  of  the 
hybrid  Input-output  and  error-reduction  approaches  was  used  to  estimate 
the  object  f  [2.1,  2.2].  In  the  object  domain,  the  constraint  that  no 
energy  fall  out  outside  the  binary  support  constraint  w  was  used.  The 
algorithm  was  stopped  when  It  had  converged  to  a  solution  (after  about 
250  to  1000  Iterations).  The  triangular  support  was  used  to  minimize 
any  algorithm  stagnation  problems  [2.8].  In  general,  the  complex-valued 
estimate  f  will  differ  from  the  object  f  by  a  constant  phase.  This 
phase  Is  the  angle  between  the  vectors  f  and  ?  and  was  determined  by 


taking  the  dot  product  between  the  two  vectors.  After  eliminating  this 
phase  difference,  the  estimation  error  e,  where 
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was  then  computed. 

One  series  of  computer  simulations  used  M  equal  to  64  and  a  right 
triangular  support  constraint  with  base  and  height  equal  to  64.  The 
Initial  estimate  of  the  object  Input  to  the  Iterative  algorithm  was  an 
array  of  complex-valued  random  numbers  whose  real  and  Imaginary  parts 
were  uncorrelated  and  Gaussian  distributed  with  zero  mean  and  unit 
variance.  Figure  2-2  plots  the  estimation  error  e  versus  the  SNR  and, 
for  comparison,  a  corresponding  plot  of  the  square  root  of  the 
normalized  Cramer-Rao  lower  bound,  e0/fff  [see  Eq.  (2-23)],  As  shown,  In 
a  log-log  plot,  most  of  the  estimation  errors  lie  along  a  line 
approximately  parallel  to  the  lower  bound  curve.  This  behavior  holds 
out  to  a  SNR  of  1014  at  which  point  the  estimation  error  Is  about  10‘7 
and  the  finite  precision  of  the  computer  Is  reached. 

Another  series  of  simulations  used  the  same  support  constraint,  but 
the  Initial  estimate  was  the  object  to  be  estimated.  Although  this 
starting  point  cannot  be  used  In  any  practical  application,  It  Is  useful 
In  determining  the  best  possible  performance  obtainable  with  the 
algorithm.  In  this  case,  estimation  error  Increases  In  the  early 
Iterations  and  the  algorithm  converges  to  a  solution  consistent  with  the 
noisy  Fourier  Intensity  measurements.  The  estimation  errors  were 
slightly  lower,  but  nearly  Identical  to  those  obtained  with  a  random 
Initial  estimate.  A  third  series  of  simulations  used  M  equal  to  64  and 
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a  right  triangular  support  constraint  with  base  and  height  equal  to  16. 
Aialn,  similar  behavior  was  found,  but  the  separation  between  the  line 
of  estimation  error  and  the  lower  bound  curve  Increased. 

These  results  encourage  the  use  of  estimation  theoretic  lower 
bounds  in  the  study  of  phase  retrieval  and  phase  retrieval  algorithms, 
the  lower  bound  did  not  exceed  the  simulation  errors,  thus  giving 
additional  confidence  In  the  theoretical  developments  of  Sections  2.3 
and  2.4.  The  slope  of  the  estimation  error  line  Is  very  similar  to  that 
of  the  lower  bound  curve,  thus  Indicating  the  lower  bound  can  be  used  to 
predict  relative  performance  of  the  Iterative  Fourier  transform 
algorithm  versus  SNR.  It  Is  likely  both  that  a  greater  lower  bound  can 
be  computed  (the  Cramer-Rao  bound  Is  not  necessarily  a  greatest  lower 
bound)  and  that  an  improved  algorithm  could  reduce  the  estimation  error. 

2.6  A  SECOND  PHASE  RETRIEVAL  PROBLEM 

A  second  phase  retrieval  problem  of  Interest  Is  one  In  which  noisy 
complex-valued  measurements  of  the  Fourier  transform  Including  a  random 
(l.e.,  degraded)  phase  are  made,  rather  than  real -valued  measurements  of 
the  Fourier  Intensity.  Using  the  notatlonal  conventions  of  Section  2.2, 
the  measurement  model  Is 

Sp  »  fc  Fp  exp(1^p)  +  Np  (2-25) 

where 
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the  phase  0p  Is  a  random  variable*  the  noise  Np  Is  complex-valued,  and, 
unlike  Section  2.2,  p  «  (plt  p2);  pj,  p2  »  0,  1,...,  M-l  (not  2M-1) 
since  the  measurements  are  complex-valued. 

The  method  of  Section  2.3  can  be  applied  with  the  same  assumptions 
as  to  the  statistics  of  fffl  and  Np  (with  E[INpl2]  «  (aN)2)  and  the 
relation 
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If  It  Is  assumed  that  the  phase  j1p  Is  uniformly  distributed  from  0 
to  Zw  (modeling  a  complete  loss  of  phase  Information  In  the  measurement 
of  Fp),  then  (as  shown  In  Appendix  B)  the  lower  bound  (eQm)2  on  the 
mean-squared  error  In  the  estimate  of  f  Is 
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and  lo  and  Ij  are  modified  Bessel  functions  of  the  first  kind.  Note 
that  Eq,  (2-29)  Is  similar  in  form  to  Eq.  (2-23).  Much  of  the  discus¬ 
sion  at  the  end  of  Section  2.4  Is  therefore  also  applicable  here.  Fur¬ 
ther  Investigation  of  this  second  phase  retrieval  problem  (such  as 
evaluation  of  the  parameter  a)  was  not  undertaken. 


2.7  CONCLUSION  AND  SUGGESTIONS  FOR  FURTHER  RESEARCH 


In  this  Investigation  of  the  application  of  estimation  theoretic 
lower  bounds  to  phase  retrieval  and  Image  reconstruction  problems,  the 
Cramer-Rao  lower  bound  on  the  mean-squared  error  In  the  object  estimate 
from  Fourier  Intensity  measurements,  given  additive  noise,  Gaussian 
statistics,  and  Nyqulst  sampling,  was  found.  The  lower  bound  approaches 
the  appropriate  values  In  the  limits  of  high  and  low  SNR.  Comparison  of 
the  lower  bound  to  computer  simulations  showed  that  current  Iterative 
Fourier  transform  algorithm  performance  parallels,  but  does  not  achieve 
the  lower  bound.  The  lower  bound  for  the  error  1  ri  estimation  from 
complex-valued  Fourier  measurements  with  random  phase  was  also  found  and 
Is  of  a  similar  form  to  that  of  the  Fourier  Intensity  bound.  Further 
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research  should  Investigate  other  measurement  models  (e.g.,  Fourier 
magnitude  measurements)  object  domain  constraints  (e.g.,  nonnegativity), 
statistical  assumptions  (e.g.,  Poisson  noise),  and/or  other  estimation 
theoretic  lower  bounds. 
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SECTION  3 

UNIQUE  CLOSED  FORM  RECONSTRUCTION  ALGORITHM 
3.1  INTRODUCTION 

Since  the  object's  autocorrelation  function  can  be  computed  from  the 
modulus  of  Its  Fourier  transform,  reconstructing  the  object  from  Its 
autocorrelation  Is  equivalent  to  reconstructing  It  from  the  modulus  of 
Its  Fourier  transform.  In  an  earlier  effort,  It  was  shown  that  a  unique 
closed-form  algorithm  for  reconstructing  an  object  from  Its 
autocorrelation,  which  operated  In  a  recursive  fashion,  was  possible  for 
two  very  special  kinds  of  sampled  objects:  those  fitting  within  a 
rectangle  with  an  additional  point  off  one  corner  of  the  rectangle  and 
those  fitting  within  a  triangle  having  nonzero  corners.  This  earlier 
result  has  been  vastly  generalized  to  Include  sampled  objects  having 
supports  whose  convex  hulls  have  no  parallel  sides,  a  very  large  class 
of  objects.  This  generalized  algorithm,  which  Includes  a  uniqueness 
proof,  summarized  In  Appendix  C  and  Is  described  In  detail  In  Appendix 
D. 

Experimental  reconstruction  results  obtained  using  the  algorithm  are 
shown  In  Section  3.2.  Although  the  present  form  of  the  algorithm  Is 
very  sensitive  to  noise,  limiting  its  practical  use,  it  has  proven  to  be 
very  valuable  In  that  It  suggests  useful  Illumination  pattern  (support) 
constraints,  as  Is  demonstrated  In  Section  4.1.  Another  problem  with 
this  reconstruction  algorithm  Is  that  It  explicitly  assumes  a  sampled 
object,  l.e.  one  consisting  of  an  array  of  delta  functions,  and  It 
cannot  In  Its  present  form  be  employed  for  real-world  continuous 
objects.  In  Section  3.3  an  example  of  how  we  would  experience 
difficulty  In  Interpreting  the  uniqueness  of  the  solution  for  the 
continuous-object  case  Is  shown.  One  possible  way  around  this  problem 
Is  to  use  the  quasi -sampl Ing  method  suggested  In  Section  3.4. 


3.2  EXPERIMENTAL  CLOSED-FORM  RECONSTRUCTION  RESULTS 


Autocorrelation  data  was  computer-simulated,  Including  the  effects 
of  noise,  and  Images  were  reconstructed  using  the  closed-form 
reconstruction  algorithm  described  in  section  3.1  and  Appendices  C  and 
D. 


For  each  reconstruction  experiment  an  object  f(x,y),  fitting  within 
a  triangular  support  was  Fourier  transformed: 


F(u,v)  -  $tf(x,y)]  (3-1) 

The  Intensity  (squared  modulus),  I F(u, v) 1 2 (  of  the  Fourier  transform  was 
computed,  and  It  was  scaled  In  Intensity  so  that  the  total  Integrated 
Intensity  became  equal  to  a  given  number  of  photons, 


Nd  -  Z  IF(u,v)l2. 
H  uv 


(3-2) 


Then  each  Intensity  sample,  IF(u,v)l2,  was  replaced  with  a  random 
number,  IFn(u,v)!2,  drawn  from  a  Poisson  distribution  with  mean  and 
variance  equal  to  I F(u, v) I 2.  When  IF(u,v)l2  Is  a  large  number  (*32), 
then  a  Gaussian  approximation  to  the  Poisson  distribution  Is  used.  This 
Poisson  noise  process  simulates  the  effect  of  photon  (shot)  noise  on  the 
measured  Fourier  Intensity  data.  The  normalized  RMS  error  (NRMSE)  of 
the  data  Is  given  by 


E 


IFn' 


Z  [lFn(u,v)l  -  IF(u,v)l]2 

Z  IF(u.v) I2 
uv 


1/2 


(3-3) 


A  noisy  autocorrelation  was  computed: 

rn(x,y)  •$F“1[lFn(u,v)l2];  (3-4) 


fa: 


32 


and  an  Image,  gn(x,y),  was  reconstructed  using  the  closed-form 
reconstruction  algorithm.  The  NRMSE  of  the  reconstructed  Image  Is  given 
by 


Z  lag  (x.y)  -  f(x,y)l 
xv  " _ 


Z  ‘f (X.y) r 

xy 


211/2 


(3-5) 


where  a  Is  a  constant  chosen  to  minimize  the  error  metric,  which 
accounts  for  the  unknown  phase  constant  associated  with  f(x,y).  It  can 
be  shown  that  the  value  of  a  that  optimizes  the  Image  NRMSE  Is 


Z  f(x,y)fl*(x,y) 

- x - 

Z  ifl(x,y)r 

xy 


(3-6) 


Examples  of  Images  of  objects  reconstructed  from  noisy  data,  for  which 
the  object  Is  a  uniform  triangle,  are  shown  In  Figures  3-1  to  3-3  for 
various  sizes  of  the  object.  Figure  3-4  plots  the  Image  NRMSE  versus 
the  data  NRMSE  for  the  Images  shown  In  Figures  3-1  and  3-2.  Several 
Interesting  effects  are  evidenced  from  these  reconstruction  examples. 
First,  the  closed-form  algorithm  Is  very  sensitive  to  noise.  A  fraction 
of  a  percent  error  In  the  data  results  In  several  percent  error  In  the 
Image.  Second,  Increased  data  error  results  In  Increased  Image  error, 
but  only  In  an  average  sense.  Occasionally  the  Image  error  can  be 
greater  when  the  data  error  Is  less,  because  for  a  given  amount  of  data 
error  the  Image  error  that  one  gets  Is  highly  variable.  Depending  on 
the  particular  realization  of  the  noise  In  the  data,  the  three  corner 
points  will  have  different  amounts  of  error.  Small  differences  In  the 
error  of  the  corner  points  can  yield  large  differences  In  the  error  of 
the  Image  since  the  corner  points  are  used  over  and  over  again  and  the 
error  from  them  propagates  and  Is  magnified  as  the  recursive  steps  build 
upon  one  another.  This  also  gives  rise  to  a  third  effect:  the  error 
for  the  interior  points  of  the  reconstructed  Image  Is  much  worse  than 
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FIGURE  3-1.  Qx  8  OBJECT  AND  IMAGES  RECONSTRUCTED  BY  THE  CLOSED-FORM 
RECURSIVE  ALGORITHM.  Number  of  photons  listed  In  the  Fourier  Intensity 
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FIGURE  3-4.  IMRMS  ERROR  OF  THE  RECONSTRUCTED  IMAGE  VERSUS  NRMS  ERROR  OF 
THE  DATA.  Crosses  are  for  the  8  x  8  triangles  and  squares  for  the  16  x 
16  triangles. 


the  error  of  the  edge  points  In  the  Image.  Fourth,  for  similar  reasons 
the  error  of  larger  Images  Is  far  worse  than  the  error  of  smaller 
Images.  Fifth,  a  stripe  pattern  parallel  to  each  of  the  edges  tends  to 
occur.  This  happens  for  the  following  reason.  Suppose  that  one  of  the 
three  reconstructed  corner  points  Is  brighter  than  It  should  be.  Then 
the  opposing  edge  of  the  Image,  the  computation  of  which  Involves 
division  by  the  corner  point,  will  tend  to  be  too  dark.  Then  the  next 
Inward  row  (or  column)  from  the  edge,  the  computation  of  which  Involves 
subtraction  of  terms  Involving  the  edge,  will  tend  to  be  too  bright, 
etc. 


How  badly  this  striping  effect  affects  the  interpretabl llty  of  an 
Image  was  tested  by  using  a  picture  of  an  airplane  as  the  object, 
Imbedded  In  a  32  by  32  triangle.  The  results  of  reconstruction 
experiments  from  noisy  data  using  the  closed-form  reconstruction 
algorithm  for  this  object  are  shown  In  Figures  3-5,  3-6  and  3-7.  As 
seen  from  Figure  3-5,  the  Image  can  still  be  discerned  through  the 
partially-obscuring  striped  pattern.  Therefore  the  intelligibility  of 
the  image  may  be  understated  by  the  Image  NRMSE.  Figure  3-6  shows  the 
Image  NRMSE  versus  the  total  number  of  photons  for  all  the  noise  values 
tried  for  this  object,  and  Figure  3-7  shows  the  same  information,  but  as 
a  function  of  data  NRMSE.  Once  again,  a  fraction  of  a  percent  error  In 
the  data  results  In  several  percent  error  In  the  reconstructed  Image. 

Since  It  was  suspected  that  the  corners  (vertices)  play  a  pivotal 
role  in  determining  reconstructed  Image  quality,  a  series  of  computer 
reconstruction  experiments  were  performed  with  the  same  object  modified 
to  have  brighter  or  dimmer  corners.  Figure  3-8  shows  reconstructed 
Images  for  the  108  photons  case  for  an  object  with  all  three  corners 
multiplied  by  the  same  factor,  for  a  variety  of  factors.  Figure  3-9 
shows  a  plot  of  the  NRMS  error  of  the  Images  as  a  function  of  the  corner 
multiplier.  When  the  corner  multiplier  Is  unity  (1,e.  no  modification), 
the  NRMS  error  of  the  Image  ranged  between  about  0.06  to  0,15  (depending 
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Top  left:  noise  free  object;  Others:  reconstructed  Images  —  number  of 
photons  (left  to  right):  Row  1:  106,  10s ;  Row  2:  107,  107,  10s ,  108;  Row 
3:  109,  109,  1010,  10l°;  Row  4:  10^ 1 ,  1011,  101?,  lO1?.  Note  108  -*• 
0.0107  NRMSE. 
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FIGURE  3-7.  CLOSED-FORM  RECONSTRUCTION  ERROR  VS  DATA  ERROR. 
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FIGURE  3-8.  CLOSED-FORM  RECONSTRUCTION  CHANGING  ALL  THREE  CORNERS,  108 
PHOTONS.  Reconstructed  images  --  factor  multiplying  corners  (left  to 
right);  Row  1:  64,  32,  16,  8;  Row  2:  4,  2,  1,  1;  Row  3:  0.9,  0.9,  0.85, 

0.8;  Row  4:  0.75,  0.71,  0.5,  0.25. 
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FIGURE  3-9.  CLOSED-FORM  RECONSTRUCTION  ERROR  VS  FACTOR  MULTIPLYING 
THREE  CORNERS,  FOR  10^  PHOTONS. 


on  the  realization  of  the  noise  process).  For  larger  values  of  the 
corner  multiplier  the  error  decreased  substantially,  and  for  smaller 
values  It  Increased  dramatically.  Similar  results  are  shown  In  Figs.  3- 
10  and  3-11  for  the  case  of  10  photons  (which  are  noisier  than  the  10 
photons  case,  as  expected). 

Figures  3-12  through  3-15  show  the  reconstruction  results  when  only 
a  single  corner  (the  bottom  corner)  of  the  object  Is  modified.  These 
results  show  that  when  a  single  corner  Is  brighter,  then  the  portion  of 
the  reconstructed  Image  opposite  the  brighter  corner  Is  reconstructed 
with  greater  fidelity  than  the  rest  of  the  Image.  As  the  brightness  of 
a  single  corner  Increases,  at  first  the  quality  of  the  entire  image 
Improves  somewhat,  but  eventually  the  quality  of  the  Image  as  a  whole 
degrades  when  the  corner  becomes  too  bright.  A  possible  explanation  for 
this  Is  that  as  the  one  corner  becomes  very  bright,  the  other  corners 
have  an  Increasingly  smaller  fraction  of  the  total  energy,  which  causes 
errors  in  the  parts  of  the  reconstructed  Image  opposite  them. 

From  the  results  above,  we  can  see  that  when  the  corners  are  as 
bright  as  the  rest  of  the  object,  then  the  closed-form  recursive 
algorithm  Is  highly  sensitive  to  noise;  and  If  the  corners  are  dimmer, 
then  It  gets  much  worse.  This  is  particularly  damaging  because 
realistic  Illumination  patterns  will  have  tapered  edges  and  corners, 
making  the  corners  much  dimmer  than  the  rest  of  the  object. 
Consequently,  performance  of  the  closed-form  recursive  algorithm  would 
be  extremely  poor  If  the  support  constraint  Is  formed  by  a  tapered 
Illumination  pattern. 

3.3  CONTINUOUS-SPACE  TRIANGULAR  SUPPORT 

The  closed-form  recursive  algorithm  includes  a  proof  of  uniqueness 
for  triangular  objects  of  known  support  defined  on  a  rectangular  grid  of 
points  (l.e,  sampled).  It  does  not  immediately  follow,  however,  that  In 
the  continuous  world  triangular  objects  of  known  support  are  unique. 
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FIGURE  3-10.  CLOSED-FORM  RECONSTRUCTION  CHANGING  ALL  THREE  CORNERS,  10' 
PHOTONS.  Reconstructed  Images  —  factor  multiplying  three  corners  (left 
to  right):  Row  1:  64,  32,  16,  8;  Row  2:  4,  2,  1,  0.707. 
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FIGURE  3-11.  CLOSED-FORM  RECONSTRUCTION  ERROR  VS  FACTOR  MULTIPLYING 
THREE  CORNERS,  FOR  10?  PHOTONS. 
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FIGURE  3-12.  CLOSED-FORM  RECONSTRUCTION  CHANGING  BOTTOM  CORNER,  lo8 
PHOTONS.  Reconstructed  Images  -  factor  multiplying  corner  (left  to 
right):  Row  1:  64,  32,  16,  8;  Row  2:  4,  2,  1,  1:  Row  3:  0.9,  0.8,  0.71, 
0.65;  Row  4:  0.65,  0.55,  0.5,  0.25. 
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FIGURE  3-13.  CLOSED-FORM  RECONSTRUCTION  ERROR  VS  FACTOR  MULTIPLYING 
BOTTOM  CORNER,  FOR  108  PHOTONS. 
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FIGURE  3-14.  CLOSED-FORM  RECONSTRUCTION  CHANGING  BOTTOM  CORNER,  10? 
PHOTONS.  Reconstructed  Images  --  factor  multiplying  corner  (left  to 
right):  Row  1:  32,  16,  8,  4;  Row  2:  2,  1,  0.71,  0.5. 


CONNER  CllflNGE  MULTIPLIER 


FIGURE  3-15.  CLOSED-FORM  RECONSTRUCTION  ERROR  VS  FACTOR  MULTIPLYING 
BOTTOM  CORNER,  FOR  107  PHOTONS. 
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Consider  the  following  two  objects! 

'  -x-y, 

for  o  i  y  i  1 

and 

0  <  x  £  1  -  y, 

f^x.y)  «  1 

2+x-y, 

for  0  i  y  i  1 

and 

y-1  i  x  i  0, 

.  o, 

otherwise 

‘  2 -x-y, 

for  0  i  y  i  1 

and 

0  i  x  i  1-y 

f2(x,y)  ■  < 

x-y, 

for  0  $  y  i  1 

and 

y-l  i  x  <  o 

•  o, 

otherwise  . 

Both  objects  have  the  same  triangular  support 

T  ■  {(x,y)s  0  i  y  i  i  and  Ixl  4  1  -  y). 

The  objects'  Fourier  transforms  are 

F^u.v)  -  (1  +  1u)  g(u,v) 
and 

F2(u,v)  •  (1  -  1u)g(u,v) 

where 

g(u,v)  -  e"1v  J  [t  s1nc(ut/2)]2  e1vtdt. 

Therefore  IFi(u,v)l  -  I Fg (u, v) I  ■  (l+u2)  I g (u t v) 1 2 .  Thus  we  have  two 
different  objects  with  the  same  support  and  the  same  Fourier  modulus, 
contrary  to  our  notion  that  triangles  should  be  unique.  [Although 
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f2(x,y)  “  (-X ,y) t  they  are  not  equivalent  solutions  with  the  sense 

that  f j (x,y)  and  f^C-x.-y)  are]. 

Note,  however,  that  this  Is  not  quite  analogous  to  the  discrete 
triangle  case.  Since  f^x.y)  approaches  + 1  as  the  upper  corner  (0,1)  Is 
approached  from  the  lower  left,  but  approaches  -1  as  the  upper  corner  Is 
approached  from  the  lower  right,  we  could  consider  the  upper  corner  to 
have  a  value  equal  to  the  average  of  those  two  limiting  values,  or  zero. 
But  It  was  assumed  for  the  discrete  model  that  the  corners  were  nonzero. 
Even  though  the  analogy  Is  not  exact,  it  Is  close,  and  this  example 
Illustrates  the  difficulty  In  translating  the  discrete-model  results  to 
the  continuous  case,  which  Is  the  physical  case  of  Interest. 

3.4  QUASI-SAMPLING  ILLUMINATION  PATTERN 

One  problem  with  the  closed-form  reconstruction  algorithm  Is  Its 
reliance  on  the  object  being  sampled.  In  this  section,  we  show  that  by 
a  special  kind  of  Illumination  we  can  approximately  achieve  the  desired 
sampling  effect  In  the  target  area. 

If  one  Illuminates  a  target  area  with  four  mutually  coherent  point 
sources  in  the  far  field,  at  a  distance  R,  given  by 

<5(u  -  uQ,  v  -  vQ)  +  <5(u  -  uQl  v  +  vQ)  +  ff(u  +  uQ,  v  -  vo) 

+  <J(u  +  uQ,  v  +  vQ), 

we  get  a  field  at  the  target  area  given  by  the  sum  of  four  plane  waves 
(Fourier  transforms  of  the  delta  functions): 

UR)-1  {exp  [^(U0x  +  W0y)]  +  exp  [^§*(u0x  -  vQy)] 
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+  exp  +  vQy)]  +  exp  [^('V  -  vQy)]} 


(XRj-^exp^  u,x]  +  exp(^|S  u0x)].[exp  ({f1  v0y)  +  exp(^  v)] 


<  /2irU.X\ 

4(XR)  cos  (-££-)  cos  (-5^“] 


f2fvnyx 


which  has  Intensity 


16(XR)"2 


cos 


which  has  lines  of  zeros  along  x  *  XR(n  +  1/2) / (2u0)  and  along  y  => 
XR(n+l/2)/(2u0) ,  for  n  ■  0,  +1,  +2,...  . 


The  Illumination  pattern  would  be  of  limited  extent  which  could  be 
modeled  by  multiplying  the  above  by  a  slowly  varying  weighting  function 
defining  the  fleld-of-vlew,  (XR/4)t(x,y) ,  so  that  the  entire 
Illumination  pattern  Is 


w(x,y) 


t(x,y)  cos 


r2*u0xT 
.  XR  J 


cos 


where  there  are  a  number  of  cycles  of  the  cosines  over  the  extent  of 
t(x,y). 


What  Is  accomplished  by  this  is  a  quasi -sampling  of  the  object.  It 
may  be  possible  to  use  the  closed-form  recursive  reconstruction 
algorithm  to  reconstruct  an  object,  Illuminated  by  w(x,y)  above,  from 
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Its  autocorrelation,  or  It  may  be  necessary  to  make  modifications  to 
reduce  the  errors  due  to  approximating  this  pattern  by  a  true  sampling 
pattern. 

Note  that  by  the  addition  of  more  plane  waves  It  Is  possible  to  get 
sharper  local  maxima  and  broader  stripes  of  low  Intensity,  but  at  the 
expense  of  a  more  complicated  Illumination  system  with  phase-stability 
problems  of  Its  own. 

In  the  real  world,  the  four  mutually  coherent  illumination  sources 
may  have  an  unknown  relative  phasing  between  them.  If  the  constant 
relative  phases  of  the  four  sources  are  fa,  fa  and  ^4,  then  the 
product  of  cosines  like  the  equation  above  occurs  only  If  fa  -  fa  -  fa  + 
fa  ■  0.  This  Implies  a  stringent  stability  requirement  on  the 
Illumination  system,  but  It  requires  the  control  of  only  a  single 
parameter  (one  piston  term)  rather  than  the  control  of  the  phase  of  an 
entire  large  aperture. 


SECTION  4 

CONSTRAINT  INVESTIGATION 

In  this  section  the  various  forms  of  constraints  that  might  be 
useful  for  phase  retrieval  are  discussed.  Section  4.1  describes  results 
obtained  with  a  variety  of  support  (illumination  pattern)  constraints. 
The  vast  majority  of  the  effort  to  date  concentrated  on  developing 
Imaging  concepts  based  on  the  support  constraint.  Section  4.2  describes 
other  constraints  that  might  also  prove  useful. 

4.1  EFFECT  OF  ILLUMINATION  PATTERN  SHAPE 

The  support,  S,  of  an  object  Is  defined  as  the  set  of  points  for 
which  the  object  Is  nonzero.  For  the  case  of  a  satellite  imaged  against 
the  night  sky  or  a  ship  Imaged  on  calm  water  with  a  SAR,  the  support  of 
the  object  Is  basically  the  fllled-ln  outline  of  the  object.  For  an 
airborne  or  spaceborne  sensor  looking  downward  at  a  general  scene,  the 
extent  of  the  object  Is  basically  defined  by  the  field  of  view  of  the 
sensor.  This  latter  case  may  not  represent  a  useful  support  constraint. 
However,  for  an  Imaging  system  employing  active  Illumination,  the 
transmitted  beam  (the  Illumination  beam)  can  take  on  a  known  shape  at 
the  plane  of  the  target,  and  It  can  be  designed  to  occupy  an  area 
smaller  than  the  field  of  view  of  the  receiver.  Then  the  effective 
support  of  the  object  Is  the  support  of  the  Illumination  beam  pattern. 
For  the  case  of  a  SAR,  It  Is  assumed  that  when  no  phase  Is  available  the 
pulse  repetition  frequency  Is  at  least  twice  that  ordinarily  required  by 
Nyqulst  sampling  when  phase  Information  Is  available. 

The  two  most  Important  properties  of  an  Illumination  pattern  are  Its 
shape  (elliptical,  rectangular,  polygonal,  etc.)  and  Its  taper  (how 
slowly  It  transitions  from  the  bright  part  of  the  pattern  to  where  It  Is 
effectively  zero).  As  shown  In  the  proposal  [4.1,  p.  2-29],  phase 
retrieval  algorithms  are  much  more  effective  for  some  shapes  (which  we 
refer  to  as  strong  shapes)  than  for  others.  Furthermore,  phase 


retrieval  algorithms  are  more  effective  for  sharp  support  constraints, 
l.e.  when  there  Is  little  or  no  taper  to  the  Illumination  pattern  [4.1, 
p.  2-25].  Section  5  of  this  report  details  the  results  of  our 
Investigation  of  the  effects  of  tapered  Illumination  patterns  and  noise 
and  of  algorithm  Improvements  that  were  made  for  the  case  of  larger 
amounts  of  taper.  In  what  Immediately  follows  we  discuss  the  effects  of 
the  shape  of  the  support. 

In  early  phase  retrieval  work  there  was  not  an  awareness  that  the 
support  of  the  object  played  an  Important  role  In  the  success  of  phase 
retrieval.  Early  successful  reconstruction  results  were  for  space 
objects  whose  supports  were  naturally  non-centrosymmetrlc  [4.2].  Other 
groups  attempted  phase  retrieval  for  unnatural  objects  --  scenes  bounded 
by  squares  --  and  were  unsuccessful.  Flddy,  Brames  and  Dainty  [4.3] 
found  that  the  Iterative  Fourier  transform  algorithm,  although  It  worked 
poorly  for  a  rectangular  support,  worked  well  for  a  support  consisting 
of  a  rectangle  plus  an  extra  point  just  off  one  corner  of  the  rectangle. 
This  latter  support  has  the  special  property  that  any  sampled  function 
defined  on  that  support,  which  Is  nonzero  at  the  extra  point  and  at  one 
opposite  corner,  has  a  Fourier  transform  that  Is  a  nonfactorable 
polynomial  according  to  Elsensteln's  Irreduclblllty  theorem.  This 
Implies  that  the  phase  retrieval  problem  Is  unconditionally  unique  for 
objects  of  this  type.  In  retrospect,  from  those  results  we  can  make  the 
crucial  connection  between  three  different  aspects  of  the  phase 
retrieval  problem:  the  support  of  the  object,  the  uniqueness  of  phase 
retrieval,  and  the  success  of  the  Iterative  Fourier  transform  algorithm. 

The  trends  connecting  those  three  elements,  which  we  have  continued 
to  confirm,  are  the  following.  First,  the  support  of  the  object 
determines  whether  ambiguities  are  possible.  Second,  objects  for  which 
uniqueness  can  be  proven  are  easier  to  reconstruct  by  the  Iterative 
Fourier  transform  algorithm  than  are  other  objects.  The  first  trend  Is 


amply  demonstrated  In  Appendices  C  and  D  which  show  that  sampled  objects 
having  known,  convex  hulls  with  no  parallel  sides  are  unique.  The 
second  trend  Is  shown  by  the  reconstruction  results  [4.1,  pp.  2-24  and 
2-28]  In  which  objects  having  known  triangular  support  (which  are  unique 
—  see  Appendix  D)  and  objects  having  known  supports  with  separated 
parts  (which  even  In  one  dimension  are  usually  unique  —  see  examples  In 
Section  5)  are  easily  reconstructed  while  objects  with  other  support 
constraints,  like  that  of  a  single  ellipse  or  a  single  rectangle,  are 
difficult  to  reconstruct. 

The  closed-form  reconstruction  algorithm  described  In  Appendix  D  may 
not  be  practical  for  use  on  real-world  data,  since  It  requires  the 
objects  to  be  modelled  discretely  (as  a  grid  of  delta  functions  or 
sampled  points)  and  it  Is  very  sensitive  to  noise,  particularly  If  the 
vertex  points  are  dim  (see  Section  3.2).  Nevertheless,  It  does 
constitute  a  uniqueness  proof  for  the  types  of  objects  to  which  It  can 
In  theory  be  applied:  objects  whose  support  has  a  convex  hull  with  no 
parallel  sides.  This  leads  us  to  consider  Illumination  patterns  of  this 
type.  Figures  4-1  and  4-2  show  examples  of  reconstruction  experiments 
using  Illumination-pattern  shapes  suggested  by  the  uniqueness  proof. 
Figure  4-1 (a)  shows  the  modulus  of  a  complex-valued  SEASAT  SAR  Image 
multiplied  by  a  binary  pattern  (representing  the  Illumination  pattern) 
In  the  shape  of  a  triangle.  Figure  4-1 (b)  shows  the  modulus  of  Its 
Fourier  transform  (the  corresponding  signal  history)  (the  Fourier  phase 
was  discarded).  The  Iterative  Fourier  transform  algorithm  was  used  to 
reconstruct  an  Image,  the  modulus  of  which  Is  shown  In  Figure  4-1 (c), 
from  the  Fourier  modulus  using  the  known  support  pattern.  The  result  is 
an  excellent  reconstruction  after  only  160  Iterations.  The  example  of 
Figure  4-2  Is  similar,  except  that  a  pentagon-shaped  Illumination 
pattern  and  support  constraint  were  used.  The  result  shown  In  Figure  4- 
2(d)  Is  after  2990  Iterations,  when  It  was  still  in  the  process  of 
slowly  coverging  toward  the  solution.  At  this  point  It  strongly 
resembles  the  correct  object  but  requires  more  Iterations  for  complete 


FIGURE  4-1.  RECONSTRUCTION  EXPERIMENT  EMPLOYING  TRIANGULAR-SHAPED 
ILLUMINATION  PATTERN,  (a)  (upper  left)  modulus  of  Illuminated  object, 
(b)  (lower)  modulus  of  signal  history,  the  Fourier  transform  of  the 
object;  (c)  (upper  right)  modulus  of  Image  reconstructed  from  (b)  using 
the  Iterative  transform  algorithm  with  a  triangular  support  constraint. 
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FIGURE  4-2.  RECONSTRUCTION  EXPERIMENT  EMPLOYING  PENTAGON-SHAPED 
ILLUMINATION  PATTERN,  (a)  (upper  left)  illuminated  object;  (b)  (upper 
right)  modulus  of  signal  history;  (c)  (lower  left)  partially 
reconstructed  Image  after  890  Iterations;  (d)  (lower  right)  partial ly 
reconstructed  Image  after  2990  iterations. 


convergence.  Given  the  difficulty  In  reconstructing  complex-valued 
Images  with  contiguous  supports  (with  the  exception  of  triangular 
support)  [4.1,  pp.  2-24  to  2-30],  the  success  of  this  kind  of  support 
constraint  would  have  been  difficult  to  anticipate  were  It  not  for  the 
uniqueness  proof.  Comparison  of  Figures  4-1  and  4-2  shows  that  the 
further  the  sides  are  from  being  parallel,  the  better.  Triangular 
supports  are  probably  optimal  for  polygonal  supports. 

Iri  summary,  to  date  the  support  constraints  found  to  be  most  useful 
for  phase  retrieval  are  (a)  supports  having  two  or  more  well -separated 
parts  and  (b)  supports  having  convex  hulls  with  no  parallel  sides.  The 
farther  the  sides  are  from  being  parallel,  the  better. 

4.2  OTHER  CONSTRAINTS 

Any  Information  that  Is  In  a  domain  other  than  the  domain  of  the 
measured  data  has  the  potential  for  being  a  useful  constraint  for  phase 
retrieval.  Constraints  In  the  domain  of  the  measured  data  usually  just 
limit  the  available  data  and  tend  not  to  be  useful  for  phase  retrieval. 
Candidate  constraints  for  the  phase  retrieval  problem  are  listed  In 
Table  4.1, 


Table  4-1 

CANDIDATE  CONSTRAINTS 

Support  (Illumination  pattern) 

Nonnegativity 

Polarization 

Transmitted  waveform 

Point  scatterer  In  scene 

Other  scene  characteristics 


Of  these,  the  support  constraint  received  the  most  attention,  and  It  Is 
discussed  In  Section  4.1.  The  other  constraints  are  described  In  what 
follows. 


Nonnegatlvlty 

The  nonnegativity  constraint  has  been  very  useful  In  previous  phase 
retrieval  efforts  [4.2]  and  also  In  other  Image  reconstruction  problems, 
such  as  tomographic  reconstruction  from  Incomplete  projections  and 
constrained  deconvolution.  Unlike  the  support  constraint,  nonnegativity 
must  exist  naturally  —  we  do  not  know  how  to  Impose  it  artificially. 
It  naturally  occurs  In  most  passive,  noncoherent  Imaging  scenarios.  The 
brightness  distribution  of  an  Incoherently-Illuminated  reflecting  object 
or  a  self-emissive  object  Is  characterized  by  a  real,  nonnegative 
function  (power  or  photons  per  unit  area).  This  can  be  true  for 
actively  Illuminated  objects  as  well,  as  long  as  the  Illumination  Is 
sufficiently  Incoherent.  An  exception  In  which  this  constraint  may  not 
be  valid  Is  for  passive  Doppler  Imaging  as  encountered  In  the  PACE 
program  [4.4],  for  which  the  aperture  function  Is  band-pass. 
Nonnegativity  Is  not  as  useful  for  band-pass  systems  since  the  Impulse 
response  has  very  large  negative  (or  complex-valued)  sldelobes  which 
convolve  the  Image,  destroying  Its  nonnegativity.  For  the  cases  In 
which  nonnegativity  naturally  does  occur,  It  should  be  relied  on  heavily 
as  a  phase  retrieval  constraint. 

Polarization 

Certain  kinds  of  reflecting  objects  have  distinctly  different 
reflectivities  for  the  two  different  receive  polarities  (l.e.  for  same 
polarity  as  transmit  or  for  opposite  polarity).  As  an  example,  corner 
reflectors  reflect  either  very  strongly  or  very  weakly  depending  on  the 
polarization.  Unfortunately,  from  a  single  collection  It  is  not 
Immediately  obvious  how  to  utilize  this  Information.  On  the  other  hand, 


If  two  collections  are  made  simultaneously,  one  for  each  polarization, 
then  there  Is  Increased  possibility  of  using  polarization 
advantageously.  One  such  possibility  would  be  to  use  the  difference 
between  two  degraded  Images  (with  measured  Fourier  phase  In  the  presence 
of  phase  errors)  to  Identify  point-like  reflectors.  Then  the  point-like 
reflectors  could  be  used  In  the  prominent-point  processing  described 
below.  Other  examples  of  using  polarization  may  be  also  be  possible. 

Transmitted  Waveform  Type 

Early  on  In  the  program  It  was  thought  that  perhaps  transmitting  a 
pulse  with  missing  frequency  bands  might  be  useful  Insofar  as  It  would 
constitute  a  support-like  constraint  In  the  signal  history.  Upon 
further  examination,  It  appears  that  such  missing  frequencies  would 
primarily  result  In  a  loss  of  data  rather  than  constituting  a  useful 
constraint. 

A  point  worth  making  relating  to  transmitted  wavefront  Is  that  the 
use  of  phase  retrieval  techniques  may  facilitate  the  use  of 
nonconventlonal  waveforms.  As  the  transmitted  waveform  departs  from  the 
standard  set  (e.g.  the  chirp  waveform),  the  availability  of  hardware 
that  can  form  the  desired  waveform  in  a  phase-stable  manner  may  be 
questionable.  By  reducing  the  tolerance  on  the  phase  stability  of  a 
waveform  generator  It  may  be  possible  to  achieve  waveforms  that  would 
otherwise  be  very  difficult  to  produce.  The  reduced  tolerance 
Imaging/phase  retrieval  techniques  may  provide  the  means  for  reducing 
the  phase  stability  of  the  waveform  generator  while  maintaining  the 
desired  resolution. 

Point  Scatterers  In  Scene 


Presently,  point-like  scatterers  (prominent  points)  In  the  target 
area  are  used  for  correcting  small  amounts  of  phase  errors  In  SAR  signal 
histories  [4.5,  4.6].  Prominent  point  processing  can  also  be  of  great, 
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utility  for  the  case  of  severe  phase  errors  or  when  no  phase  Information 
at  all  Is  measured.  One  particular  scenario  for  phase  correction  In  the 
presence  of  large  one-dlmenslonal  phase  errors  has  already  been 
demonstrated  [4.6].  For  the  case  of  motion  compensation  errors  In  SAR, 
one  has  a  one-dlmenslonal  (azimuth)  phase  error.  This  occurs 
particularly  for  the  case  of  Inverse  SAR,  for  example  the  radar  Is 
ground-based  and  the  (noncooperative)  target  flies  by  with  a  poorly 
known  flight  path  and  rotation.  If  there  exists  a  dominant  prominent 
point  scatterer  in  a  given  compressed  range  cell,  then  It  can  be  used  to 
calculate  the  azimuth  phase  error  (taking  Its  phase  to  be  the  phase 
error).  The  phase  errors  In  all  range  bins  can  be  corrected  by 
subtracting  that  phase. 

Other  Scene  Characteristics 


The  constraints  mentioned  above  are  common  to  large  classes  of 
Imagery.  Also,  there  may  often  be  additional  constraints  that  exist  In 
specific  Instances.  For  example,  If  the  scene  has  been  imaged, hy 
another  sensor  system  or  by  a  similar  sensor  at  an  earlier  time,  then 
these  additional  Images  may  contain  Information  that  can  be  counted  on 
to  appear  In  the  present  Image  and  therefore  can  be  used  as  an  a  priori 
constraint.  Examples  Include  the  known  existence  of  permanent  cultural 
targets  or  of  no-return  areas  such  as  lakes  or  smooth  surfaces. 
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SECTION  5 

RECONSTRUCTION  OF  OBJECTS  WITH  TAPERED  ILLUMINATION 


5.1  STATEMENT  OF  PROBLEM 


It  Is  well  known  that  knowledge  of  the  support  of  an  object  can  be  a 
powerful  source  of  Information  In  Image-reconstruction  problems.  By 
support  we  mean  a  compact  region  outside  of  which  the  object  Is  known  to 
be  zero,  and  we  denote  the  set  of  points  that  make  up  the  support  by  the 
symbol  S.  In  particular,  considerable  success  has  been  realized  In 
reconstructing  an  object  from  Its  Fourier  modulus  and  a  known  support 
[5.1 ,5.2].  In  the  reduced-tolerance  imaging  program  an  effort  Is  being 
made  to  exploit  this  ability. 


Consider  an  active  sensor  system  that  Illuminates  a  target  area  so 
that  the  Illumination  Is  confined  to  a  predetermined  region.  Let  h(x,y) 
be  the  complex  reflectivity  of  the  target: 

h(x,y)  -  ih(x,y) I  e  h  .  (5-1) 


Let  w(x,y)  be  the  complex  Illumination  fuctlon: 

,  \  ,  s  1#w(x,y) 

w(x,y)  -  lw(x,y) I  e 

We  define  the  effective  object  as  the  product  of  the  complex 
reflectivity  of  the  target  and  the  Illumination  function: 


(5-2) 


f(x,y)  •  w(x,y)  h(x,y) 

■  lf(x,y)  I  eWx,y).  (5-3) 

The  effective  object  will  now  have  a  support  corresponding  to  the  known 
extent  of  w(x,y).  The  Intensity  pattern  of  the  field  emanating  from  the 
Illuminated  target  Is  measured  In  the  far  field  which  may  be  Interpreted 


as  the  squared  modulus  of  the  Fourier  transform  of  the  effective  object. 
Known  phase-retrieval  algorithms  may  then  be  employed  to  reconstruct  the 
effective  object  from  the  support  constraint  and  the  measured  Fourier 
modulus. 


Notice  that  there  Is  some  freedom  In  the  choice  of  the  form  of  the 
Illumination  pattern.  For  example,  the  shape  of  the  outline  of  the 
pattern  could  be  specifically  selected  to  enhance  the  usefulness  of  the 
support  constraint.  It  is  known  that  certain  symmetries  In  object 
support  can  create  stagnation  problems  In  phase-retrieval  algorithms. 
Consequently  the  outline  of  the  Illumination  pattern  should  have  an 
asymmetric  shape.  Furthermore,  there  Is  some  evidence  that  a  support 
consisting  of  disjoint  regions  can  be  an  advantage  In  phase  retrieval. 
Finally,  It  Is  useful  to  choose  an  Illumination  function  with  a  constant 
modulus  over  most  of  the  region  of  Illumination  thus  facilitating  the 
Inversion  of  Eq.  (5-3) s 


h(x,y) 


(x,y)eS 


w(^yT 


(5-4) 


U?(x,yWw(x,y)) 

e 


when  one  desires  the  complex  reflectivity  of  the  target  without  the 
Influence  of  the  Illumination  pattern, 


Unfortunately,  the  modulus  of  the  Illumination  pattern  will  not  be 
binary  In  practice,  but  will  have  some  taper  associated  with  It  at  the 
edges,  due  to  the  effects  of  diffraction  by  the  aperture  of  the 
Illuminator.  The  contrast  between  an  Ideal  untapered  11 1  urnl nation 
pattern  and  a  more  realistic  Illumination  function  Is  Illustrated  In 
Figure  5-1.  Intuitively  we  might  expect,  and  experimentally  It  has  been 
shown  [5. 1,5, 2],  that  the  reconstruction  of  an  object  from  Its  Fourier 
modulus  and  support  would  be  more  challenging  for  an  object  with  a 


tapered  profile  than  for  one  with  a  sharp  profile.  It  was  the  purpose 
of  this  Inquiry  to  explore  this  Issue  via  computer  simulation  and  to 
look  for  algorithmic  modifications  that  would  enhance  restoration  for 
this  case.  For  example,  It  was  hoped  at  the  outset  that  any 
difficulties  Incurred  by  tapered  Illumination  might  be  offset  by  the 
support  being  disjoint. 

5.2  PRELIMINARY  SIMULATIONS 

We  began  by  exploring  the  effect  of  tapered  Illumination  on  phase 
retrieval  by  means  of  computer  simulation.  A  pair  of  disjoint  ellipses 
was  used  as  the  basic  shape  for  the  Illumination  pattern.  The  untapered 
Illumination  pattern  was  assigned  a  value  of  unity  within  the  ellipses 
and  zero  outside.  Taper  was  Introduced  by  convolving  the  binary 
ellipses  with  a  convolution  kernel.  The  normalized  kernels  used  In 
these  preliminary  simulations  are  shown  In  Figure  5-2.  Cross  sections 
of  the  edge  of  the  resulting  Illumination  patterns  are  given  In  Figure 
5-3. 

As  mentioned  earlier,  It  was  speculated  that  disconnected  support 
might  help  to  overcome  any  problems  associated  with  tapered 
Illumination.  For  this  reason  the  total  Illumination  pattern  was  chosen 
to  be  two  disjoint  ellipses.  Simulations  were  performed  for  objects 
with  differing  amounts  of  Illumination  taper  and  differing  amounts  of 
separation  between  ellipses  In  the  Illumination  pattern.  A  given 
simulation  was  performed  by  first  multiplying  complex  SEASAT  SAR  Imagery 
by  the  given  Illumination  pattern  to  create  an  effective  object. 
Because  It  Is  the  effective  object  that  we  try  to  recover  through  phase- 
retrieval  techniques,  we  will  henceforth  refer  to  this  as  the  true 
object.  This  object  was  Fourier  transformed  with  an  FFT  and  the  Fourier 
magnitude  was  retained.  The  known  region  of  support  In  the  object 
domain  was  supplied  by  hard  limiting  (thresholding)  the  Illumination 
pattern  with  a  very  small  threshold  value.  A  standardized  sequence  of 
error-reduction  and  hybrid  Input-output  Iterations  [5.3]  were  then 


FIGURE  5-2.  DISCRETE  CONVOLUTION  KERNELS  USED  TO  ADD  TAPER 
TQ  BINARY  ILLUMINATION  PATTERN.  A.  Center-weighted  kernel 
yields  taper  #1.  B.  Evenly-weighted  kernel  yields  taper  #2. 
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FIGURE  5-3.  CROSS  SECTIONS  OF  ILLUMINATION  OF  TAPER  USED  IN 
PRELIMINARY  SIMULATIONS 


performed  to  reconstruct  the  object  from  Its  Fourier  modulus  and 
support.  Convergence  for  all  simulations  was  monitored  by  calculating  a 
Fourier  domain  normalized  error  metric, 


Z  (lG{u,v)i  -  I  F(u,v)  l)2 

Ep  -  - - L 

F  H  I F(u, v) 1 2 

u,v 


(5-5) 


where  F(u,v)  Is  the  discrete  Fourier  transform  of  the  true  object  9 

f(x,y),  and  6(u,v)  Is  the  Fourier  transform  of  the  Image  estimate.  The  ■ 

convergence  Is  portrayed  In  Figure  5-4  for  six  kinds  of  Illumination—  ■ 

three  amounts  of  taper,  each  with  two  amounts  of  separation  between  1 

ellipses.  It  Is  Important  to  note  that  Figure  5-4  Is  a  log-log  plot  and  gj 

therefore  the  behavior  of  the  algorithm  becomes  horizontally  compressed 
with  Increasing  number  of  Iterations.  Figures  5-5,  5-6,  and  5-7  give 
the  final  reconstructions  for  each  of  the  cases  tested.  These  results 
confirm  our  expectation  that  Increased  amounts  of  Illumination  taper 
make  the  reconstruction  process  more  difficult.  In  fact,  for  the  case 
with  the  largest  amount  of  taper  the  algorithm  convergence  appears  to 
have  stagnated.  This  Is  In  spite  of  the  fact  that  the  amount  of  taper 
Is  extremely  mild.  There  are  51  pixels  along  the  major  axis  of  the 
large  ellipse  and  only  two  pixels  of  taper  at  the  edge.  Thus 
convergence  of  the  traditional  algorithm  appears  to  be  relatively 
sensitive  to  Illumination  taper.  It  Is  Important  to  realize  that  the 
convergence  curves  shown  in  Figure  5-4  correspond  to  a  specific  Initial 
estimate  and  that  the  convergence  behavior  could  vary  when  alternative 
Initial  estimates  are  used. 

5.3  THE  SHRUNKEN-MASK  ALGORITHM 

In  order  to  explore  the  reasons  for  stagnation  we  created  a 
difference  Image  between  the  modulus  of  the  true  object  and  that  of  the 
restored  object  for  the  case  of  Intermediate  taper  (taper  #1).  This 
difference  Image  Is  bipolar  and  a  bias  was  added  for  display  In  Figure 
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FIGURE  5-5.  RECONSTRUCTIONS  OF  OBJECTS  WITH  UNTAPERED  ILLUMINATION 
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5-8.  Notice  that  the  difference  Image  Indicates  that  the  reconstruction 
Is  shifted  In  the  horizontal  direction  relative  to  the  true  object. 
This  suggests  that  the  algorithm  may  be  stagnating  because  of  Its 
Inability  to  properly  register  the  reconstruction  relative  to  the 
support  constraint  when  tapered  Illumination  is  used. 

To  better  understand  this  conjectured  mode  of  stagnation  consider  an 
object  f(x,y)  with  tapered  Illumination.  We  define  a  binary  mask  m(x,y) 
that  Is  the  characteristic  function  of  S,  the  known  support: 

1,  (x,y)eS 

m(x,y)  ■  '  (5-6) 
■0,  (x.y)eS' 

where  S'  stands  for  the  complement  of  a  S.  An  Image  g'(x,y)  outputted 
by  the  Iterative  Fourier  transform  algorithm  Is  the  Inverse  Fourier 
transform  of  a  Fourler-domaln  estimate  having  modulus  equal  to  the  given 
Fourier  modulus  data  coupled  with  the  current  estimate  of  the  Fourier 
phase.  Suppose  the  output  Image  Is  just  a  shifted  version  of  the 
object: 

fl'(x.y)  -  f(x  -  x0,  y  -  y0).  (5-7) 

A  shift  In  the  object  domain  Introduces  a  linear  phase  factor  In  the 
Fourier  domain  and  has  no  effect  on  the  Fourier  modulus.  This  output 
Image  will  clearly  satisfy  the  Fourier  modulus  constraint.  The  output 
Image  has,  however,  been  shifted  relative  to  the  mask  so  that  the  object 
domain  support  constraint  has  been  violated.  In  other  words, 
multiplying  by  the  mask  function  will  crop  an  edge  of  the  output  Image. 
We  use  a  normalized  error  metric  to  Indicate  the  degree  of  Inconsistency 
between  an  estimate  and  the  object  support  constraint: 


FIGURE  5-8.  MODULUS  DIFFERENCE  BETWEEN  OBJECT  AND  RECONSTRUCTION. 
(Illumination  due  to  Taper  #2  In  Figure  5-3)  The  bipolar  difference 
Image  has  been  biased  up  for  display. 


where  m'(x,y)  Is  the  characteristic  function  of  S'.  If  the  shift  vector 
(x0fy0)  Is  small  with  respect  to  the  Illumination  taper  the  object 
domain  error  metric  will  also  be  relatively  small.  This  Is  because  only 
the  tapered  edges,  where  there  Is  little  energy,  will  be  cropped  and 
this  contributes  to  only  a  small  portion  of  the  total  object  energy. 

Though  the  cropped  output  Image  now  satisfies  the  support 
constraint,  Its  Fourler-transform  modulus  no  longer  exactly  equals 
I F (u , v) I .  It  can  easily  be  shown  that  the  Fourler-domaln  error  metric 
Is  also  small,  Thus  the  error  metric  penalty  Is  small  In  either  domain 
when  shifting  a  tapered  object  by  a  small  amount.  An  algorithm  that 
chooses  successive  estimates  based  upon  these  error  metric  objective 
functions  will  be  Insensitive  to  small  shifts  and  would  easily  stagnate 
due  to  extremely  small  slopes  In  the  objective  function.  Such  an 

algorithm  would  be  Ineffective  at  finding  the  proper  object 

registration.  Furthermore,  one  can  Imagine  that,  with  the  right, 
redistribution  of  the  cropped  object  energy,  an  object  estimate  could 
correspond  to  a  local  minimum  In  the  objective  function. 

Although  the  mode  of  stagnation  just  presented  Is  conjectured,  It 
provides  the  motivation  for  the  "shrunken-mask"  algorithm.  The 
shrunken-mask  algorithm  Is  designed  to  find  the  proper  registration 
early  on  in  the  Iterative  reconstruction  thus  circumventing  shift- 
related  stagnation  that  might  otherwise  appear. 

Consider  a  new  binary  mask  mt(x,y)  created  by  hard! 1ml ting  the 
tapered  Illumination  function  with  some  Intermediate  threshold  value: 

’1,  (x,y)  such  that  lw(x,y)l  >  t 

"»t(x,y)  »  ■  (5-9) 

.0,  (x,y)  such  that  lw(x,y)l  $  t 


where  t  Is  the  threshold  value,  0  £  til.  Notice  that  mt(x,y)  will  be 
a  "shrunken''  version  of  the  full  mask  m(x,y)  defined  for  t  -  u.  Suppose 
that  we  employ  the  shrunken  mask  as  the  support  constraint.  If  we  crop 
the  true  object  with  the  shrunken  mask  this  will  yield  an  estimate  with 
a  modest  penalty  in  both  the  object  and  Fourier  domains,  so  long  as  the 
threshold  value  Is  not  too  large.  Notice,  however,  that  a  shift  In  this 
cropped  estimate  will  yield  an  object-domain  penalty,  due  to  the 
shrunken  mask  and  the  artificially  created  discontinuous  object  edges, 
that  is  much  greater  than  the  penalty  that  would  be  due  to  the  normal 
support  constraint.  Thus  we  would  expect  the  output  Image  to  be 
centered  better  with  the  shrunken  mask. 

While  the  Fourier  modulus  and  the  shrunken-mask  support  constraints 
are  Inconsistent,  they  may  still  be  Jointly  enforced  In  an  Iterative 
reconstruction  algorithm  to  get  an  Intermediate  reconstruction.  We 
might  expect  this  Intermediate  result  to  display  gross  features  of  the 
true  object  In  proper  registration.  Enlarging  the  mask  to  its  full  size 
(setting  t  ■  0)  removes  the  constraint  inconsistency  and  allows  for  a 
complete  reconstruction  that  hopefully  avoids  shift-related  local 
minima.  The  shrunken-mask  algorithm  is  shown  schematically  In  Figure  5- 
9. 


The  shrunken-mask  algorithm  was  first  tested  on  the  elliptical 
object  where  the  taper  (taper  #2  in  Figure  5-3)  Induced  stagnation  In 
previous  trials.  The  convergence  characteristics  are  displayed  In 
Figure  5-10.  It  Is  clear  that  the  conventional  algorithm  performed 
better  early  on  In  the  Iterative  sequence.  This  Is  reasonable  since  the 
support  constraint  Is  Initially  looser  and  easier  to  satisfy.  By 
contrast,  the  shrunken-mask  algorithm  error  metric  quickly  levels  off 
while  an  Intermediate  reconstruction  Is  being  produced  but  drops 
dramatically  when  the  full-size  mask  is  Introduced. 


79 


True  objict 


Intermediate  result 


Final  reconstruction 


FIGURE  5-9.  THE  SHRUNKEN-MASK  ALGORITHM 
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FIGURE  MO.  CONVERGENCE  FOR  SHRUNKEN-HASK  ALGORITHM.  Taper  It  Taper 
H  In  Figure  5-J.  The  shrunken  mask  had  a  threshold  value  t  •  .9. 


5.4  THE  ENLARGING  MASK  ALGORITHM 


While  the  success  In  the  shrunken-mask  algorithm  Is  encouraging,  the 
amount  of  Illumination  taper  for  which  It  worked  remains  extremely 
small.  A  much  more  substantial  taper  was  Introduced  by  using  a  circular 
convolution  kernel  with  a  radius  of  4  pixels.  Tho  resultant 
Illumination  pattern  Is  shown  In  Figure  5-11.  When  the  shrunken-mask 
algorithm  was  applied  to  an  object  with  this  Illumination,  the 
convergence  was  not  much  better  than  the  conventional  algorithm.  This 
was  true  for  a  variety  of  threshold  values  that  were  tested.  Apparently 
the  Increased  taper  Is  a  significant  obstacle  for  the  shrunken-mask 
algorithm. 


Recall  that  the  shrunken-mask  algorithm  jumps  from  a  small  mask  to 
the  full  mask  In  a  single  step.  A  logical  generalization  of  the 
shrunken-mask  algorithm  uses  several  intermediate-size  masks  In  order  to 
make  a  more  gradual  transition  to  the  full  size  mask.  We  call  this  the 
"enlarglng-mask"  algorithm.  The  collection  of  masks  used  In  a  given 
application  Is  characterized  by  a  sequence  of  threshold  values.  The 
convergence  curve  for  the  enlarglng-mask  algorithm,  when  applied  to  an 
object  with  this  Increased  taper,  Is  shown  In  Figure  5- 1 2(a) .  The 
scallop  effect  exhibited  by  the  convergence  curve  Is  due  to  the 
successive  application  of  Increasingly  enlarged  masks.  The  enlarglng- 
mask  algorithm  clearly  out-performs  the  shrunken-mask  algorithm  and  the 
final  reconstruction  exhibits  very  good  agreement  with  the  data  and 
support  constraint. 

A  final  trial  was  performed  with  an  even  more  realistic  Illumination 
taper  created  with  a  Gausslan-llke  convolution  kernel  with  a  maximum 
radius  of  6  pixels.  This  kernel,  K(r),  was  formed  by  correlating  a 
circle  function  with  a  radius  of  2  pixels  with  Its  own  autocorrelation: 

K(r)  •  CIRC  (r/2)  **  CIRC  (r/2)  **  CIRC  (r/2),  (5-10) 
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FIGURE  5-12.  COMPARISON  OF  CONVERGENCE  BEHAVIOR  OF  THREE  ALGORITHMS. 
For  Illumination  taper  due  to  (a)  a  circular  convolution  kernel  of 
radius  4  pixels,  and  (b)  a  Gaussian-1 ike  convolution  kernel  of 
maximum  radius  6  pixels.  The  threshold  sequence  for  the  enlarging- 
mask  algorithm  was  0.5,  0.3,  0.2,  0.1,  0.001. 


.  .J.V 


•  '  '  f  ',1  »i  •ll  ’il  M  *'l'*  1  *  ft  I  '*  l'T  #**  I  J  t  «'  !  ,’t 

►  i-  - 1  -»-*  •  ki.i  -  4*1  ^»l  ».k  ami  «.A  «.4  i-  t  ».4  V.  4  m-l  v  >  A  *4  .  4  *4 


where  the  double  star  Indicates  two-dimensional  cross-correlation.  This 
kernel  Is  a  close  approximation  to  a  two-dimensional  Gaussian  function. 
The  resultant  Illumination  pattern  Is  shown  In  Figure  5-11.  Note  that 
this  Illumination  has  a  smoother  taper  and  that  the  tails  extend  out 
further  at  very  low  levels.  The  convergence  curves  for  this  case  are 
shown  In  Figure  5-12(b).  Again  the  enlarglng-mask  algorithm  succeeds  at 
finding  a  reconstruction  that  Is  In  excellent  agreement  with  the  data 
and  support  constraint  whereas  the  conventional  algorithm  did  not.  This 
reconstruction  Is  visibly  Indistinguishable  from  the  true  object.  The 
results  of  reconstructions  performed  with  and  without  the  enlarglng-mask 
algorithm  are  given  In  Figure  5-13  for  Illumination  patterns  due  to  the 
circular  and  Gaussian  convolution  kernels. 

While  tapered  Illumination  presents  significant  stagnation  problems 
for  conventional  phase-retrieval  algorithms,  these  examples  demonstrate 
that  the  enlarglng-mask  algorithm  successfully  circumvents  these 
difficulties,  even  In  the  presence  of  large  amounts  of  taper. 

The  success  of  the  enlarglng-mask  algorithm  leads  us  to  ask  how  much 
taper  can  be  Introduced  before  the  performance  of  the  algorithm 
deteriorates.  In  order  to  Investigate  this  question,  a  new  series  of 
simulations  was  performed  using  a  frame  size  of  256  x  256  pixels.  The 
source  of  the  Illumination  pattern  was  changed  from  the  pair  of  ellipses 
to  an  Isosceles  triangle  with  50  pixels  on  a  side.  These  choices  were 
made  for  multiple  reasons.  In  the  first  place,  this  simulation  design 
allows  for  the  Introduction  of  extreme  amounts  of  taper  without  aliasing 
problems.  Secondly,  the  disjoint  elliptical  patterns  were  dropped  since 
a  single  connected  Illumination  pattern  results  when  enough  taper  Is 
added.  Because  of  this,  It  would  be  difficult  to  determine  If  a 
decrease  In  performance  were  Jue  to  Increased  taper,  the  breakdown  In 
the  disjoint  nature  of  the  Illumination  pattern,  or  a  combination  of  the 
two.  The  use  of  a  single  connected  Illumination  pattern  helps  to 
Isolate  the  effects  due  to  taper  alone.  Finally,  the  triangular  shape 
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FIGURE  5-13.  RECONSTRUCTIONS  WITH  AND  WITHOUT  THE  ENLARGING-MASK 
ALGORITHM  (EMA).  The  Illumination  pattern  In  A-C  Is  shown  In  Figure 
5-11B.  The  illumination  pattern  in  D-F  is  shown  in  Figure  5-UC. 


was  selected  because  It  provides  the  best  known  support  constraint  for 
phase  retrieval  (see  Section  4.1). 


In  a  real  system,  taper  will  unavoidably  be  introduced  by  convolving 
a  (possibly  binary)  source  Illumination  pattern,  b(x),  with  a  system 
Impulse  response  ( IPR)  function,  a(x): 

w(x)  ■  b(x)  *  a(x)  ,  (5-11) 

where  w(x)  Is  the  coherent  Illumination  pattern,  as  before.  In  an 
optical  system  the  IPR  Is  the  Fourier  transform  of  the  coherent  transfer 
function  or  aperture  function,  A(u).  The  casting  of  an  Illumination 
pattern  by  a  coherent  optical  system  Is  Illustrated  In  Figure  5-14. 
Similar  concepts  apply  to  radar  systems. 

In  order  to  minimize  the  amount  of  energy  that  Is  diffracted  to 
regions  far  afield  of  the  desired  binary  Illumination  pattern,  the  side 
lobes  of  the  IPR  should  be  reduced.  This  Is  done  conventionally  with  an 
apodlslng  aperture.  The  form  of  the  particular  apodlzatlon  used  In 
these  simulations  Is  given  here: 


A(u) 


P  *  Pc 
P  >  Pc 


(5-12) 


where  p  *  lul  and  pc  represents  the  cutoff  frequency.  Figure  5-15  shows 
a  cross  section  of  such  an  apodlslng  aperture  and  the  apodlsed  IPR  that 
results  from  It.  This  particular  apodlslng  function  Is  one  of  many  used 
conventionally  [5.4]  although  It  Is  not  necessarily  optimal  for  the 
enlarglng-mask  algorithm. 
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FIGURE  5-14.  FORMATION  OF  TAPERED  ILLUMINATION  PATTERN 
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FIGURE  5-15.  APGDISING  APERTURE  AND  IMPULSE  RESPONSE  FUNCTION 

A.  Cross  section  of  circularly  s>ranietric  apodising  function. 

B.  Intensity  plot  of  resulting  impulse  response  function. 


Four  Illumination  patterns  were  produced,  each  with  a  different 
amount  of  taper.  The  amount  of  taper  Introduced  was  controlled  by 
varying  the  value  of  the  cutoff  frequency  In  Eq.  (5-12).  The  four 
Illumination  patterns  are  shown  In  Figure  5-15.  These  Illumination 
patterns  were  applied  to  actual  SAR  data  of  an  extended  cultural  scene 
to  get  four  effective  objects.  The  Fourier  Intensity  for  each  effective 
object  represents  the  raw  data  in  this  simulation.  The  Fourier  modulus 
for  each  effective  object  Is  shown  In  Figure  5-17.  A  close  examination 
of  Figure  5-17  reveals  that  the  Fourier  modulus  does,  In  fact,  change 
with  varying  amounts  of  taper.  It  Is  difficult  to  see  any  specific 
trends  In  the  Fourier  modulus  that  Is  Indicative  of  the  amount  of  taper 
present. 

A  standardized  enlarglng-mask  algorithm  was  exercised  with  each  data 
set.  The  details  of  this  algorithm  are  shown  In  Table  5-1.  While  the 
choice  of  number  of  Iterations  at  each  threshold  Is  probably  reasonably 
good,  we  do  not  claim  that  It  Is  optimum. 

Table  5-1 

STANDARDIZED  ENLARGING-MASK  ALGORITHM 


Threshold 

(%  of  Illumination  Peak) 

90 

70 

50 

30 

10 

1 


Number  of  Iterations 

150 

200 

200 

200 

700 

400 


The  effective  or  true  objects  with  varying  amounts  of  taper  and  their 
associated  enlarglng-mask  reconstructions  are  displayed  In  Figure  5-18. 
These  reconstructions  Indicate  that  for  very  largo  amounts  of  taper  (12- 
16  pixels)  even  the  enlarglng-mask  algorithm  Is  unable  to  produce  a 
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i-16.  ILLUMINATION  PATTERNS  WITH  VARYING  AMOUNTS  OF  TAPER 
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FIGURE  5-17.  FOURIER  MODULUS  FOR  OBJECTS  WITH  TAPERED  ILLUMINATION. 
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high-fidelity  reconstruction.  Nevertheless,  low-frequency  Information 
Is  restored.  An  overexposed  version  of  the  Images  In  Figure  5-18  Is 
given  In  Figure  5-19.  This  Illustrates  that  the  true  object  with  16 
pixels  of  taper  really  does  have  energy  at  the  extremes  of  the  tapered 
Illumination.  In  addition,  It  Is  clear  that  the  enlarglng-mask 
algorithm  tries  to  reconstruct  too  much  energy  In  the  taper  region. 

It  Is  useful  to  define  a  mean-square  error  figure  of  merit  between 
the  reconstruction  and  the  true  object.  Of  course  such  a  figure  of 
merit  would  not  be  available  In  a  true  Imaging  application  because  the 
true  object  would  not  be  available  for  the  computation.  Nevertheless, 
this  figure  o'  merit  Is  very  useful  In  characterizing  the  performance  of 
the  algorithm  in  controlled  settings  such  as  computer  simulations.  The 
normalized  absolute  error  Is  defined: 

E  1 ag(x-xo)  -  f (x) 1 2 

E  lf(x)|2 

x 

wnere  f(x)  Is  the  true  object  and  g (x)  Is  the  reconstruction.  Because 
phase  retrieval  may  be  unable  to  provide  absolute  registration  of  the 
reconstruction,  g(x)  Is  optimally  shifted  by  the  vector  x0  prior  to  the 
computation  of  Eg.  Furthermore,  the  reconstruction  will  always  have  a 
constant  phase  ambiguity.  Therefore  the  complex  coefficient  a  Is  also 
selected  to  minimize  Eg.  The  normalized  absolute  error  Is  plotted  as  a 
function  of  number  of  pixels  of  ;aper  In  Figure  5-20. 

5.5  The  Effects  of  Noise  and  Taper 

In  order  to  provide  guidance  for  the  design  of  a  real  system  one 
needs  to  know  what  the  combined  effect  of  tapered  Illumination  and  noisy 
data  Is  on  the  quality  of  reconstruction.  The  first  experiment 
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performed  to  explore  this  Issue  utilized  an  untapered  triangular 
Illumination  pattern,  and  varying  amounts  of  zero-mean  Gaussian  noise 
were  added  to  the  Fourier  Intensity.  Any  negative  values  that  resulted 
were  set  to  zero,  giving  an  estimated  Fourier  Intensity 


IF(u)l2  - 


'I  F(u)  1 2  +  n(u) 
.  0 


I F(u) 1 2  +  n(u)  >  0 
otherwise 


(5-14) 


where  n(u)  Is  one  realization  of  the  random  noise  process  with  a 
standard  deviation  of  a n.  The  amount  of  noise  Is  quantified  by  percent 
noise,  defined  here  as  the  ratio  of  the  standard  deviation  of  the  noise 
to  the  mean  value  of  the  Fourier  Intensity 


%  Noise  -  a"1  ^  22  * F(u)  1 2 

-  y2  ,  (5-15) 

where  N  Is  the  number  of  pixels  In  the  Fourier  Intensity  and  I  is  the 
mean  Fourier  Intensity.  Under  the  assumption  that  the  noiseless  Fourier 
Intensity  Is  a  fully-developed  speckle  pattern,  the  Intensity  will  have 
negative-exponential  statistics.  As  a  result  the  standard  deviation  In 
the  Intensity,  a j,  Is  equal  to  the  mean  Intensity  and  Eq.  (5-15)  can  be 
rewritten 


or 


%  Noise 


-A?  =  (%  noise)2. 

°1 


(5-16) 
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The  ratio  of  the  variance  of  the  noise  to  that  of  the  signal,  as 
expressed  In  Eq.  (5-16),  Is  an  alternative  way  of  quantifying  the  amount 
of  noise  present. 

An  estimate  of  the  Fourier  modulus  is  produced  by  taking  the  square 
root  of  the  estimated  Fourier  Intensity.  The  estimated  Fourier  modulus 
Is  used  to  enforce  the  Fourier  constraint  In  the  Iterative  phase- 
retrieval  algorithm.  A  one-dimensional  cross  section  of  the  estimated 
Fourier  modulus  Is  given  In  Figure  5-21  for  the  cases  of  no  noise  and 
10%  noise.  A  small  area  of  the  estimated  Fourier  modulus  Is  presented 
In  Figure  5-22  with  varying  amount  of  noise.  The  effect  of  the  noise  Is 
more  pronounced  near  the  nulls  In  the  Fourier  modulus,  as  expected. 

The  Iterative  phase  retrieval  algorithm  was  exercised  using  the 
triangular  support  and  the  estimated  Fourier  modulus  with  varying 
amounts  of  noise.  The  number  of  Iterations  and  the  ordering  of  the 
hybrid  Input-output  and  the  error  reduction  Iterations  were  the  same 
that  were  used  In  the  earlier  enlarging  mask  experiments  (Table  5-1). 
Figure  5-23  presents  reconstructions  for  the  various  noise  levels  and 
the  true  object  Is  given  for  comparison.  The  quality  of  the 
reconstructions  Is  excellent  up  to  the  10%  noise  level.  At  noise  levels 
of  15%  and  higher  the  reconstructions  are  noticeably  degraded. 
Nevertheless  the  gross  features  of  the  object  persist  even  at  the  30% 
noise  level.  These  results  confirm  our  view  that  the  phase-retrieval 
problem  Is  not  Ill-conditioned,  or  hypersensitive  to  noise.  Rather,  as 
moderate  levels  of  noise  are  added  In  Increasing  amounts  to  the  raw 
data,  the  respective  reconstructions  degrade  gracefully. 

The  Fourler-domaln  error  metric,  Eq.  (5-8),  associated  with  the 
final  reconstruction  Is  plotted  as  a  function  of  %  noise  In  Figure  5-24. 
Five  reconstruction  sequences  were  performed  for  the  case  of  5%  noise, 
each  with  a  different  noise  realization.  It  Is  unclear  why  these 


Normalized  Fourier  Modulus 


F-IGURE  5-22.  FOURIER  MODULUS  WITH  VARYING  AMOUNT S  OF  ADDITIVE  NOISE. 
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FIGURE  5-23.  RECONSTRUCTIONS  FOR  OBJECTS  WITH  UNTAPERED  ILLUMINATION 
AND  VARYING  AMOUNTS  OF  NOISE. 


Fourler-doinaln  error  metrics  tend  to  be  larger  than  the  trend  predicted 
by  the  other  noise  levels. 

It  Is  also  Interesting  to  compute  a  normalized  absolute  error  In  the 
Fourier  modulus: 

£  [IF(u)l  -  I F (u) ] I 
jj _ 

£  IF(u)|2 
u 

This  figure  of  merit  serves  as  yet  another  metric  for  quantifying  the 
amount  of  noise  In  the  data.  In  addition  It  represents  an  upper  bound 
on  the  achievable  Fourler-domaln  error  metric.  Consider  substituting 
the  true  object  as  the  latest  estimate  achieved  by  the  Iterative 
algorithm. 

If  this  were  done,  then 

I G (u) I  -  I F(u)  I  ,  (5-18) 

and  it  Is  easy  to  show  that 

Ef  ■  E|FI  •  (5-19) 

If  we  continue  to  Iterate  with  error  reduction  we  are  guaranteed  that  Ep 
will  not  Increase  [5.3]  and  will  probably  decrease.  The  Implication  Is 
that  solutions  exist  for  which  the  Fourler-domaln  error  metric  Is  at 
least  equal  to  and  probably  less  than  E | p I •  Since  the  Fourler-domaln 
error  metrics  don't  achieve  this  upper  bound,  as  depicted  In  Figure  5- 
24,  the  reconstruction  quality  Is  not  yet  limited  solely  by  the  noise. 
This  comparison  is  a  statement  about  algorithm  efficiency,  or  the 
ability  of  the  algorithm  to  achieve  the  upper  bound.  We  believe  that 
Improvements  to  the  algorithm  may  yet  produce  noise-limited 
reconstructions. 


These  experiments  have  examined  the  effect  of  noise  In  the  absence 
of  taper.  They  serve  as  a  reference  with  which  to  compare  experiments 
In  which  taper  Is  present.  A  moderate  amount  of  taper  (6  pixels  or  pc  * 
43  pixels)  was  selected  and  various  amounts  of  noise  were  added  to  the 
Fourier  Intensity.  The  reconstruction  sequence  and  the  number  of 
Iterations  was  the  same  that  has  been  used  throughout  the  experiments 
with  triangular  Illumination  patterns  (Table  5-1).  The  reconstructions 
with  and  without  taper  are  shown  In  Figure  5-25.  Clearly  the 
reconstruction  quality  degrades  much  more  quickly  with  increased  noise 
In  the  presence  of  taper.  Roughly  speaking,  the  reconstruction  with  IX 
noise  and  taper  Is  comparable  in  quality  to  the  reconstruction  with  10% 
noise  and  no  taper.  Figure  5-26  is  a  plot  of  the  absolute  error  In  the 
reconstruction,  Eg,  as  a  function  of  the  absolute  error  In  the  Fourier 
magnitude,  E|fi,  for  both  tapered  and  untapered  objects.  Evidently  the 
Inclusion  of  taper  significantly  hampers  the  algorithm's  ability  to 
reconstruct  In  the  presence  of  noise.  Figure  5-27  Is  a  plot  of  Ep  and 
E | p |  as  a  function  of  %  noise,  A  careful  comparison  of  Figures  5-27  and 
5-24  shows  that  the  algorithm  efficiency,  or  ability  to  achieve  the  E| p | 
upper  bound,  Is  reduced  when  taper  Is  present.  In  other  words,  there  Is 
even  more  room  for  algorithmic  Improvement  when  taper  Is  present. 
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FIGURE  5-25.  RECONSTRUCTIONS  FROM  DATA  WITH  VARYING  AMOUNTS  OF  NOISE 
FOR  UNTAPERED  AND  TAPERED  (6  PIXELS)  ILLUMINATION. 
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FIGURE  5-27.  ALGORITHM  EFFICIENCY  FOR  TAPERED  ILLUMINATION 
(6  PIXELS  TAPER). 
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SECTION  6 

GRADIENT-SEARCH  METHODS  IN  PHASE  RETRIEVAL 
6.1  INTRODUCTION 

Researchers  have  explored  many  approaches  to  solving  the  phase 
retrieval  problem.  These  Include  direct  methods  using  complex  zeros  In 
the  analytically  extended  Fourier  modulus  [6.1],  the  error-reduction 
algorithm  [6.2,  6.3],  Input-output  algorithms  [6.3],  recursive 

algorithms  [6.4,  6.5],  and  gradient-search  algorithms  [6.3,  6.6,  6.7, 
6.8].  Of  these  approaches  the  Input-output  (HIO)  algorithm  appears  to 
be  the  current  algorithm  of  choice  when  operating  on  2-dlmenslonal  data. 
The  HIO  algorithm  has  consistently  outperformed  competing  algorithms 
with  respect  to  computational  burden  and  robustness  to  noise.  In  spite 
of  the  relative  success  of  the  Input-output  algorithms  there  are 
documented  Instances  in  which  such  an  algorithm  converges  extremely 
slowly  or  even  stagnates  In  its  convergence  [6.9]. 

In  this  report  we  are  Interested  In  the  specific  phase-retrieval 
problem  for  which  the  Fourier  modulus  and  an  object  support  constraint 
are  known.  We  resurrect  the  Idea  of  employing  a  gradient-search  method 
In  the  hopes  of  developing  an  algorithm  that  will  compete  well  with  or 
complement  the  Input-output  approach.  Gradient-search  approaches 
require  the  determination  of  an  object  function  that  Indicates  the 
degree  of  consistency  with  the  data  and  the  constraints.  This  choice  Is 
pivotal  In  designing  a  specific  gradient- search  algorithm.  We  propose 
here  three  distinct  objective  functions  and  explore  the  performance  of 
each  when  used  In  conjunction  with  standard  gradient-search  techniques. 
In  the  next  section  we  discuss  the  error-reduction  algorithm,  the  parent 
of  the  Input-output  algorithms  and  Indicate  how  It  can  be  Interpreted  as 
a  gradient-search  algorithm.  We  Introduce  the  first  new  objective 
function,  called  the  summed  objective  function,  In  Section  6.3.  The 
second  and  third  objective  functions  are  Introduced  In  Sections  6.4  and 
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6.5.  These  objective  functions  utilize  the  same  object-support  error 
metric  but  differ  In  their  underlying  parameters.  Preliminary  results 
with  the  new  objective  functions  are  described  In  Section  6.6.  We 
conclude  In  Section  6.7  with  projections  of  future  work. 

6.2  THE  ERROR-REDUCTION  ALGORITHM 

An  Iterative  algorithm  that  has  enjoyed  much  success  In  phase 
retrieval  Is  known  as  the  error-reduction  (ER)  algorithm,  which  may  be 
easily  understood  by  referring  to  Figure  6-1.  This  algorithm  consists 
of  transforming  between  object  and  Fourier  domains  and  applying 
appropriate  constraints  in  the  respective  domains.  We  use  the  symbol 
gk(x)  to  represent  the  estimate  of  an  object  given  by  the  kth  Iteration 
of  the  ER  algorithm.  The  prime  notation  In  gk * (x)  Indicates  a  version 
of  the  kth  estimate  for  which  the  Fourler-domaln  constraints  have  been 
enforced.  We  use  uppercase  symbols  to  denote  a  Fourler-domaln 
representation  of  a  function.  In  practice  the  data  are  always  sampled 
and  therefore  we  use  the  discrete  Fourier  transform  (DFT) 

G(u)  •  E  g(x)e-,2lru*x/N  (6-1) 

X 

and  Its  Inverse 

g(x)  -  IT2  E  G(u)e12,u'x/N 
u 

In  the  algorithm.  Of  course  the  DFT  Is  most  efficiently  computed  with  a 
fast  Fourier  transform  ( F FT) .  In  Eqs.  (6-1)  and  (6-2)  x  and  u  are  two- 
dimensional  vectors  In  the  object  and  Fourier  domains,  respectively,  and 
the  summation  notation  Is  understood  to  represent  a  separate  summation 
for  each  component  of  the  vector  running  from  0  to  N-l. 


Apply  Object  Domain 
Constraint: 


wx)  ■ 


gk'(x),  x  e  S 
0  ,  x  e  S' 


Apply  Fourier  Modulus 
Constraint: 

,  ,  8k(u|F(u)| 

V<“>  ‘  \tuTT 


In  order  to  monitor  the  progress  of  the  ER  algorithm  It  Is  useful  to 
define  an  error  metric  for  each  of  the  constraints.  The  error  metric  Is 
essentially  a  mean  squared  error  between  estimates  before  and  after  a 
constraint  has  been  applied  and  Indicates  the  degree  of  agreement 
between  the  latest  estimate  and  the  known  constraint.  The  error  metric 
for  the  Fourier  modulus  constraint  Is  defined  as  follows: 

eF2  -  N“2  H  [!G(u)l  -  I F(u) I]2  .  (6-5) 

r  u 

The  error  metric  for  the  object-support  constraint  Is  given  by 

e02  ■  £  lfl'(x)l2  .  (6-6) 

0  xiS1 

As  the  algorithm  proceeds  both  of  these  error  metrics  will  decrease.  If 
they  simultaneously  achieve  values  close  to  or  equal  to  zero  then  the 
algorithm  has  achieved  a  restoration  that  has  good  agreement  with  both 
constraints. 

Suppose  that  we  treat  the  error  metric  ep2  as  an  objective  function 
to  be  used  In  a  gradient-search  algorithm.  Our  desire  Is  to  minimize 
the  objective  function  by  varying  a  set  of  parameters  In  the  estimate. 
The  parameters  we  employ  are  the  individual  pixel  values  of  the 
estimate.  For  the  present  we  treat  only  real-valued  objects  which 
require  N2  independent  parameters  for  an  NxN  image  (complex  objects 
require  2N2  parameters).  The  jth  pixel  In  the  object  domain  Is  located 
by  a  vector  xj  where  the  subscript  j  represents  any  convenient  ordering 
of  the  N2  pixels.  We  construct  an  N2-dimensional  Euclidian  vector  space 
for  which  each  coordinate  axis  corresponds  to  an  individual  parameter. 
Each  point  in  this  parameter  space  therefore  corresponds  to  an  object 
estimate  and  may  be  represented  by  the  parameter  vector  g(x).  We 
represent  the  J*h  parameter  and  Its  associated  parameter  space  unit 
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vector  by  g (xj )  and  vj,  respectively.  The  unit  vector  vj  may  be 
Interpreted  as  an  estimate  for  which  all  pixels  are  zero  except  for  the 
jth  pixel  which  has  unit  strength.  This  vector  may  also  be  represented 
by  the  Kronecker  delta  5X(Xj.  The  objective  function,  ep2(g(x)),  Is  a 
function  of  the  N2  parameters,  and  may  be  visualized  as  a  surface  In  an 
N2+l-d1mens1onal  space.  If  we  were  able  to  calculate  the  gradient  of 
thlr  surface  at  given  estimate  locations  then  well-known  gradient-search 
methods  could  be  employed.  The  gradient  is  formally  expressed 
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(6-7) 


One  method  of  computing  the  gradient  Is  to  proceed  numerically  using  a 
finite  differences  approximation  to  the  partial  derivatives 
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eg(xj) 


eF2(g(x)  +  gyp  -  ep2(g(x) ) 


(6-8) 


where  a  Is  small  compared  with  significant  feature  sizes  In  the 
objective  surface.  This  brute-force  approach  Is  computationally 
prohibitive  since  each  evaluation  of  ep2  Involves  an  NxN  FFT  and  this 
must  be  accomplished  for  each  of  the  N2  parameters.  Fortunately  Flenup 
[6.3]  showed  that  the  exact  partial  derivative  may  be  calculated 
analytically  as  follows: 
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(6-9) 


We  reemphasize  that  the  prime  Indicates  that  the  Fourier  magnitude 
constraint  has  been  applied  to  the  estimate.  If  Eq,  (6-9)  is 
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substituted  Into 
may  be  evaluated 


This  desirable 
realistically  be 

Perhaps  the 
steepest  descent 
may  be  Improved 
opposite  that  of 

objective  function  along  the  resulting  one-dimensional  cut  Is  then 
determined  giving  an  Improved  estimate.  This  procedure  Is  repeated 
Iteratively  until  a  local  minimum  In  the  objective  function  Is  achieved. 

Some  optimization  problems  afford  additional  a  priori  Information 
about  disallowed  regions  In  parameter  space.  There  are  many  ways  of 
constraining  the  final  solution  to  the  allowed  region  of  parameter 
space.  One  obvious  way  of  Incorporating  this  Information  Is  to  proceed 
as  usual  with  the  steepest-descent  algorithm  until  an  estimate  Is 
produced  that  violates  the  a  priori  knowledge.  A  constraint  operator  is 
then  employed  to  find  the  closest  allowed  estimate.  The  steepest- 
descent  algorithm  Is  then  applied  to  the  latest  allowed  estimate. 
Unfortunately  this  constrained  steepest-descent  algorithm  can  be  very 
slow  since  the  direction  of  steepest  descent  Is  oiten  In  competition 
with  the  direction  enforced  by  the  constraint  operator. 

A  careful  analysis  of  the  ER  algorithm  reveals  that  It  Is,  In  fact, 
a  constrained  steepest-descent  algorithm  for  which  the  objective 
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Eq.  (6-7)  the  result  Implies  that  the  entire  gradient 
with  a  forward  and  an  inverse  FFT : 


Vep (g (x) )  -  ^  2[g(xj)  -  9 ' (xj)] vj 

■  2[g(x)  -  g'(x)] 


(6-10) 


result  means  that  a  gradient  search  method  could 
employed  for  the  ep2(g(x))  objective  function. 

simplest  gradient-search  algorithm  Is  the  method  of 
[6.10].  According  to  this  approach  the  latest  estimate 
upon  by  moving  In  parameter  space  In  a  direction 
the  gradient.  The  location  of  the  minimum  of  the 
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function  Is  ep2(g(x))  and  the  knowledge  of  object  support  defines  a 
disallowed  region  In  parameter  space.  The  kth  Iteration  of  the  F.R 
algorithm  begins  with  an  estimate,  (x) t  and  replaces  Its  Fourier 
modulus  with  the  known  Fourier  modulus  to  get  gk'(x).  Notice  that  this 
Intermediate  result  Is  equivalent  to  moving  from  g|< (x)  In  parameter 
space  In  a  gk'(x)  -  gi<(x)  direction;  that  Is,  In  a  direction  opposite  to 
that  of  the  gradient.  In  fact  It  can  be  shown  that  the  objective 
function  Is  a  minimum  (zero)  at  gk'(x).  Typically  gk'(x)  will  violate 
the  known  support  and  therefore  exists  In  a  disallowed  region  In 
parameter  space.  Applying  the  support  constraint  to  gk'(x)  produces  a 
new  estimate,  g^i*  that  now  resides  In  the  allowed  region,  thus 
completing  one  Iteration  of  the  constrained  steepest-descent  algorithm. 

While  we  have  thus  far  treated  the  Fourler-domaln  error  metric  as  an 
objective  function  we  could  just  as  easily  have  selected  the  object- 
domain  error  metric,  e02(g'  (x)),  for  that  role.  The  gradient  for  this 
objective  function  Is  easily  obtained  because  the  calculation  of  the 
partial  derivative  with  respect  to  a  pixel  value  Is  more  direct: 


(6—11) 

f°  v$ 

.2g'(Xj)  Xj  e  S' 

Recall  that  the  support  constraint  operator  sets  to  2ero  all  pixels  in 
S'  and  leaves  those  In  S  untouched.  Clearly,  this  operation  moves  the 
latest  estimate  gk'(x)  in  a  direction  opposite  that  of  Ve02(g  ' (x) ) .  In 
addition  this  objective  function  is  quadratic  along  this  one-dimensional 
cut  with  a  minimum  value  (zero)  at  gk+l(x).  The  Fourier  modulus 
constraint  may  now  be  Interpreted  as  the  operator  that  takes  gk+1 (x)  out 
of  a  new  disallowed  region  In  parameter  space.  Thus  the  ER  algorithm 


qualifies  as  a  constrained  steepest-descent  algorithm  from  this  new 
perspective  as  well. 

6.3  THE  SUMMED  OBJECTIVE  FUNCTION 

Historically  the  error  metric  e02  has  been  used  to  evaluate  an 
estimate  for  which  the  Fourier  constraints  have  been  satisfied. 
Consequently,  this  error  metric  Is  a  function  of  the  pixel  values  In  a 
primed  estimate,  as  defined  In  Eq.  (6-6).  A  simple  generalization  of 
this  definition  yields  as  a  new  error  metric  that  can  be  applied  to  any 
estimate  g(x): 

«2(fl(x))  -  Z  CflU)]2  (6-12) 

xes 1 

It  Is  easy  to  show  that  the  partial  derivative  of  e02  with  respect  to 
pixel  values  In  the  estimate  has  the  same  form  as  given  In  Eq.  (6-11). 
Clearly  this  generalized  objective  function  and  Its  gradient  still 
pertain  to  functions  for  which  the  Fourier  constraints  have  been 
satisfied.  Notice,  however,  that  e02(g(x))  now  has  the  same  underlying 
parameters  as  ep2(g(x)).  This  observation  affords  still  a  third 
Interpretation  of  the  ER  algorithm  that  yields  new  Insight.  The  ER 
algorithm  may  be  viewed  as  alternately  performing  steepest-descent 
operations  on  two  objective  functions,  ep2(g(x))  and  £o2(fl(x))»  that 
coexist  In  the  same  parameter  space.  In  practice  It  Is  often  observed 
that  the  ER  algorithm  converges  rapidly  for  Iterations  early  In  the 
sequence  but  that  convergence  becomes  painfully  slow  as  the  Iteration 
number  Increases.  This  Is  because  the  work  performed  In  minimizing  the 
eF2(g(x))  objective  function  Is  largely  nullified  when  minimizing  the 
e02(g(x))  objective  function,  and  vice  versa.  Figure  6-3a  Illustrates 
this  point  pi ctorl ally.  This  viewpoint  suggests  the  definition  of  a  new 
objective  function  that  is  the  sum  of  the  opposing  objective  functions: 
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(6-13) 


es2(0(x))  ■  fip2(o(x) )  +  «„2(g(x)) 

The  gradient  of  this  new  objective  function  Is  simply  the  sum  of  the 
gradients  already  derived: 

»«,2(g(x»  ‘  7«F2(g(x»  *  Ve02(o(x>)  .  (6-14) 

The  calculation  of  this  new  gradient  Involves  a  forward  and  Inverse  FFT 
and  a  small  amount  of  computational  overhead.  Figure  6-3b  suggests  how 
moving  In  a  direction  opposite  of  the  gradient  of  the  summed  objective 
function  may  circumvent  stagnation  due  to  opposing  constraints.  Notice 
that  If  we  choose  to  remain  with  steepest  descent  using  es2,  the 
stepslze  still  has  to  be  determined.  This  can  be  accomplished  by  one  of 
a  variety  of  line  search  methods  that  utilize  additional  samples  of  the 
objective  function.  Each  additional  objective-function  evaluation 
requires  a  single  forward  FFT.  Furthermore,  because  the  gradient  of  the 
summed  objective  function  Is  so  easily  computed  more  sophisticated 
gradient-search  methods  such  as  the  method  of  conjugate  gradients  or  a 
memoryless  quasi -Newton  method  [6.10]  may  profitably  be  employed. 
Finally,  a  simple  generalization  of  these  Ideas  to  Include  complex 
objects  Is  found  In  Appendix  E. 

6.4  THE  e02(g(x))  OBJECTIVE  FUNCTION 

We  now  briefly  review  the  basic  characteristics  of  the  so-called 
Input-output  phase-retrieval  algorithms,  These  observations  will 
suggest  the  defining  of  a  new  objective  function  that  will  serve  as  an 
alternative  to  the  summed  objective  function. 


It  Is  convenient  to  partition  an  Iteration  of  the  ER  algorithm  Into 
two  steps.  The  first  step  enforces  the  Fourler-domaln  constraints  while 
the  second  step  enforces  the  object-domain  constraints.  For  the  moment 


we  focus  on  the  first,  step.  This  step  Involves  a  Fourier  transformation 
of  the  latest  estimate,  a  substitution  of  the  Fourier  modulus  by  the 
known  values,  and  an  Inverse  Fourier  transformation.  Together,  these 
operations  constitute  the  enforcement  of  Fourier  knowledge  and  may  be 
viewed  as  a  single  nonlinear  operation.  This  Is  depicted  schematically 
In  Figure  6-4,  It  Is  Important  to  recognize  that  any  output  of  this 
operation  will  satisfy  the  Fourler-domaln  constraints  and  consequently 
ep2  will  be  zero.  Should  the  output  also  satisfy  the  object-domain 
constraints  then  a  solution  has  been  found.  This  suggests  that  clever 
adjustments  to  the  Input  function  might  produce  an  output  that  more 
closely  satisfies  the  object-domain  constraints.  The  degree  of 
consistency  with  the  support  constraint  can  be  monitored  by  the  e02 
error  metric  defined  In  Eq.  (6-6).  A  variety  of  feedback  strategies 
borrowed  from  nonlinear-systems  control  theory  can  be  employed  to  modify 
the  latest  Input  In  order  to  drive  the  e02  error  metric  toward  zero. 
The  use  of  each  feedback  rule  defines  an  Individual  algorithm  and  the 
collection  of  feedback  rules  defines  the  class  of  Input-output  phase- 
retrieval  algorithms.  All  feedback  rules  that  have  been  employed  to 
date  are  point  operations  meaning  that  an  Input  pixel-value  adjustment 
Is  based  solely  upon  the  desired  change  In  the  corresponding  output 
pixel  value. 

We  recognize  Immediately  that  If  a  solution  were  to  serve  as  an 
Input  function  then  It  will  pass  through  the  nonlinear  modulus  operator 
unchanged.  Notice  however  that  other  Inputs  can  also  output  a  solution. 
In  fact  any  Input  function  with  the  proper  Fourier  phase  will  produce  a 
solution.  Thus  a  solution  will  result  from  any  of  an  uncountable 
Infinity  of  Input  functions,  many  of  which  differ  dramatically  from  the 
solution,  The  ER  algorithm  may  be  viewed  as  a  particular  Input-output 
algorithm  for  which  the  feedback  rule  drives  the  Input  (as  well  as  the 
output)  toward  a  solution.  By  contrast  most  Input-output  algorithms 
have  a  more  flexible  feedback  rule  since  they  may  converge  upon  any  of 
the  many  input  functions  that  yield  a  true  solution  upon  output. 


We  reiterate  that  any  output  for  which  e^  Is  zero  will  be  a 
solution.  Therefore,  the  task  of  simultaneously  minimizing  the  Fourier 
and  object-domain  error  metrics  has  been  converted  Into  minimizing  a 
single  error  metric.  Unlike  the  summed  objective  function,  however, 
this  blending  of  the  two  error  metrics  Into  one  Is  accomplished  without 
resorting  to  ad  hoc  methods  such  as  summing. 


We  use  the  term  objective  function  to  refer  to  an  error  metric  in 
conjunction  with  a  set  of  underlying  parameters.  A  logical  candidate 
for  an  alternative  objective  function  suggested  by  Input-output 
algorithms  Is  the  e0fc  error  metric  as  a  function  of  Input  pixel  values. 
This  new  objective  function  should  not  be  confused  with  the  e  ^ (g * (x) ) 
objective  function  used  In  the  ER  algorithm  which  treats  the  N2  object- 
estimate  (output)  pixel  values  as  parameters.  By  contrast  the  new 
objective  function,  eQ2(g(x)),  utilizes  the  Input-function  pixel  values 
as  parameters  associated  with  the  object  estimate  given  upon  output. 
Having  made  this  subtle  but  critical  distinction  we  may  now  write  an 
expression  for  the  gradient  of  the  eQ2(g(x))  objective  function: 


N2  Be2 
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(6-15) 


As  before  a  numerical  computation  of  the  gradient  Is  overwhelming.  It 
Is  natural  to  ask  If  an  analytic  expression  for  the  gradient  can  be 
derived.  While  the  details  of  this  calculation  are  outlined  In  Appendix 
F,  we  give  the  surprisingly  simple  result  here: 


IF(u)!Ge(u)  G‘(u)G*  (u) 

TgTuTT  g^TuT 


e12ru*Xj/N 


(6-16) 


where  *  denotes  complex  conjugate  and  Ge(u)  Is  the  Fourier  transform  of 
an  error  Image  ge(x),  where 

9e(x)  -  S' (x)g' (x)  (6-17) 

and 


S'  (x) 


1  ,  x  e  S' 
0  ,  x  e  S 


(6-18) 


Three  FFT  operations  are  required  to  compute  Ge  from  g*  A  very 

Important  feature  of  the  analytic  partial  derivative  quoted  In  Eq. 

(6-16)  Is  that  It  has  the  form  of  a  DFT.  The  Implication  Is  that  given 
the  expression  within  the  brackets  all  partial  derivatives  needed  to 
compute  the  gradient  are  provided  by  a  single  1  DFT.  Thus  the  total 
computational  cost  of  finding  Ve02(g(x))  for  a  given  Input  function  Is 
four  FFTs  plus  minor  overhead.  With  these  manageable  computational 
requirements  the  e0z(3(x))  objective  function  may  be  minimized  via 

various  gradient-search  algorithms.  Some  care  must  be  taken  In  the 
evaluation  of  Eq.  (6-16)  tc  avoid  division  to  zero.  This  problem  can  be 
circumvented  by  adding  a  small  constant  to  the  Fourier  magnitude  of  the 
Input  function  at  those  spatial  frequencies  for  which  IG(u)l  Is 

Identically  zero. 


Notice  that,  like  Input-output  algorithms,  there  are  many  Input 
functions  to  which  a  gradient- search  algorithm  can  converge  for  this 
objective  function.  This  means  that  the  objective  function  contains 
many  global  minima,  each  equally  acceptable  for  producing  a  solution  as 
an  output.  It  is  conceivable  that  this  multiplicity  of  Input  solutions 
could  yield  faster  convergence  rates  than  an  objective  function  having 
only  a  single  global  minimum  (e.g.  the  summed  objective  function). 
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It  is  useful  to  recognize  that  any  gradient-search  algorithm  used  In 
conjunction  with  the  eQ2(g(x))  objective  function  may  also  be 
Interpreted  as  a  particular  feedback  rule  In  an  Input-output  algorithm. 
Unlike  other  feedback  rules,  however,  this  rule  Is  not  a  point 
operation.  In  other  words,  the  gradient-search  feedback  rule  Is  more 
flexible  than  other  existing  rules  since  many  Input  pixels  may  be 
adjusted  In  order  to  effect  a  desired  change  In  a  single  output  pixel. 

Unfortunately,  there  is  no  guarantee  a  priori  that  the  eQ2(g(x)) 


objective  function  has 


surface  contour  that  lends  Itself  to 


minimization  via  gradient  search.  For  example  the  eQ2(g(x) )  surface  may 
contain  many  local  minima  in  which  gradient-search  algorithms  could 
become  entrapped.  Answers  to  such  questions  are  often  the  by-product  of 
extensive  experimentation. 
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Some  preliminary  experiments  were  performed'  In  which  the  eQ<i(g(x)) 
objective  function  was  used  In  conjunction  with  the  method  of  steepest 
descent.  A  number  of  observations  can  be  made  about  the  results 
displayed  in  Figure  6-5.  Notice  the  dominant  stripes  In  the  gradient 
Imago  for  the  first  Iteration.  By  gradient  Image  we  mean  the  Image  for 
which  each  pixel  value  Is  assigned  the  value  of  the  associated  component 
of  the  gradient.  This  Is  the  image  that  Is  scaled  and  added  to  the 
latest  Input  Image  to  acquire  the  succeeding  Input  Image  In  a  steepest- 
descent  scheme.  These  stripes  are  Intriguing;  but  their  origin  Is 
unkonwn  at  present.  The  magnitude  of  the  gradient  was  observed  to 
decrease  with  Iteration  number.  As  a  result,  the  stripes  from  the  first 
gradient  Image  still  persist  in  the  100th  Input  Image.  Notice,  however, 
that  the  stripes  do  not  appear  In  on  output  Image,  which  Is  consistent 
with  the  notion  that  the  Input  Image  need  not  resemble  the  output  Image. 
It  Is  encouraging  that  after  ICO  Iterations  the  output  Image  bears  a 
rough  resemblance  to  the  true  object.  More  experimentation  with  this 
objective  function  is  needed  before  a  judgment  can  be  made  about  Its 
usefulness.  For  example,  more  sophisticated  gradient-search  methods 
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FIGURE  6-5.  PRELIMINARY  IMAGES  DERIVED  FROM  MINIMIZING  THE  e 
OBJECTIVE  FUNCTION 


would  have  a  better  chance  of  converging  to  a  solution.  Should  the 
e02 (g (x) )  objective  function  In  conjunction  with  the  best  gradient- 
search  methods  prove  not  to  be  competitive  with  current  Input-output 
algorithms,  It  may  yet  be  useful  for  breaking  out  of  stagnation 
epi sodes. 

We  conclude  this  section  by  noting  that  while  we  have  restricted 
objects  to  be  real -valued  for  simplicity,  the  case  admitting  complex 
objects  Is  of  great  Interest  when  objects  are  illuminated  coherently. 
The  definition  and  derivation  of  the  gradient  of  the  e02(g(x))  objective 
function  for  complex  objects  Is  presented  In  Appendix  G. 

6.5  FOURIER  PHASE  PARAMETERS 

The  choice  of  underlying  parameters  for  an  objective  function  can 
have  a  tremendous  Impact  upon  the  behavior  of  gradient-search 
algorithms.  To  this  point  we  have  selected  the  Input  pixel  values  (or 
real  and  Imaginary  parts  of  the  Input  pixels)  as  our  N2  (or  2N2) 
parameters  underlying  the  e02(g(x))  objective  function.  This  choice  has 
merit  since  It  affords  an  analytic  expression  for  the  gradient  requiring 
only  four  FFTs.  An  alternative  and  very  different  set  of  parameters 
worth  consideration  Is  the  set  of  Fourier  phase  values  In  a  Fourier 
estimate  of  a  solution.  Because  the  Fourier  modulus  Is  known,  a  Fourier 
estimate  Is  determined  by  an  estimate  of  the  Fourier  phase,  ^(u): 

G'(u)  =  IF(u)leWu)  .  (6-19) 
An  inverse  FFT  gives  the  corresponding  object-domain  estimate, 


This  estimate  may  also  be  Interpreted  as  the  output  from  an  Input-output 
algorithm  since  It  has  the  proper  Fourier  modulus.  Consequently,  the 
object-domain  error  metric  can  be  computed: 

•f  ■  I-  18'WI2  •  (6-21) 

0  xeS' 

The  e02  error  metric  Is  therefore  Implicitly  a  function  of  the  Fourier 
phase  values  and  eo2(0(u))  serves  as  the  third  new  objective  function 
Introduced  In  this  chapter.  We  mention  parenthetically  that  throughout 
this  section  we  allow  for  complex-valued  objects  since  there  Is  no 
simplification  of  derivations  by  resorting  to  real-valued  objects. 
Notice  that  the  designation  of  the  Fourier  phase  values  as  the 
underlying  parameters  has  fixed  the  number  of  parameters  at  N2.  This  H 
exactly  half  the  number  of  parameters  that  occur  when  using  the  real  and 
•Imaginary  parts  of  the  Input;  pixel  values  as  parameters.  It  remains  to 
be  seen,  though,  If  an  analytic  expression  for  the  gradient  of  the 
object-domain  error  metric  with  respect  to  the  Fourier  phase  parameters 
can  be  derived. 

The  gradient  Is  defined  as 

2  J?  0e* 

Vl*<u»  -gSTIujrj  •  (6—22) 

where  vj  is  the  unit  vector  in  parameter  space  associated  with  the  phase 
parameter  at  location  uj  In  the  Fourler-domaln  estimate.  As  usual,  the 
heart  of  the  gradient  Is  the  partial  derivative 


+  c.c. 


(6-24) 


where  C.C.  stands  for  complex  conjugate.  The  partial  derivative  In  Eq, 
(6-24)  may  be  simplified: 

■  IT*  E  IF(«)le"12,U,X/N  4ny  .■**«“)  (6-251 

J  U  J 

■  N-2  E  IF(u)le",2,U,X/NH)«'^(“)«„  u  (6-26) 

u  u'uj 

Applying  the  sifting  property  of  the  Kronecktr  dolta,  1n  Eq. 

(6-26)  leaves  only  one  term  from  the  summation: 

jja^Sl  .  N-2  IF(uJ)le-12,ru.)*x/N(-1)e-1f>'uj>  •  (6-27) 

J 

Substituting  back  Into  Eq.  (6-24): 

IJ(~T  «£  [g'(x)H)N‘2IF(uj)le'i2tuj*x/V1^lJj)  +  C.C.]  (6-28) 

«  N~2  IF(Uj)  I  [  [(-1)e-1^uJ^  ]T  S'(x)g'(x)e”12TUj*x/N]  +  C.C. 

x  (6-29) 

The  summation  In  Eq.  (6-29)  Is  the  Fourier  error  Image,  Ge(u),  defined 
In  the  previous  section  by  Eq.  (6-17).  Therefore  we  have 


-  N~2  IF(Uj)l  [(-1)Ge(uJ)e“^(uj)  +  C.C.]  (6-30) 
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(6-31) 


where  Im  {•}  stands  for  the  Imaginary  part. 


Again  we  have  been  able  to  find  an  expression  for  the  gradient  with 
a  remarkably  compact  form.  Equation  (6-31)  Implies  that  the  component 
of  the  gradient  associated  with  the  spatial  frequency  uj  Is  proportional 
to  the  modulus  at  that  spatial  frequency  and  Is  dependent  upon  the 
Fourler-domaln  error  Image  and  the  latest  Fourier  phase  In  a  less  direct 
way.  An  examination  of  Eq.  (6-31)  reveals  that  the  entire  gradient  can 
be  computed  with  2  FFTs  plus  minor  overhead.  The  actual  evaluation  of 
the  objective  function  for  a  particular  Fourler-phase  estimate  requires 
only  one  FFT,  Thus  employing  Fourler-phase  values  as  optimization 
parameters  Is  certainly  competitive  with  the  use  of  Input-pixel  values 
from  the  standpoint  of  operations  required  to  compute  the  gradient.  How 
these  two  gradient-search  formulations  compare  with  respect  to 
convergence  properties  can  only  be  determined  by  experimentation.  We 
might  expect  the  Fourler-phase  formulation  to  perform  differently  since 
the  Fourler-phase  parameters  are  so  different  In  character  from  and 
nonll nearly  related  to  the  Input  pixel -value  parameters.  Use  of  the 
Fourier  phase  for  parameters  has  the  added  appeal  that  these  are  In  fact 
the  unknowns  In  the  phase-retrieval  problem.  As  a  result  the  Fourier 
phase  formulation  Is  somewhat  more  direct  and  may  lend  itself  to 
analysis  when  noise  Is  present, 

6.6  PRELIMINARY  RESULTS 

A  series  of  experiments  was  performed  to  test  the  use  of  these 
proposed  objective  functions.  The  first  experiments  performed  tested 
the  summed  objective  function  using  a  conjugate-gradient  mlnlmlzer.  The 
conjugate-gradient  algorithm  Is  known  to  have  a  convergence  rate  far 
superior  to  that  of  steepest  decent  In  the  vast  majority  of  applications 
[6-10],  This  algorithm  minimized  well  at  the  outset  but  after  several 


Iterations  It  stagnated  at  an  eroneous  estimate.  It  Is  of  considerable 
Interest  that  the  stagnation  occurred  at  the  same  estimate  for  which  the 
error  reduction  algorithm  stagnated.  Multiple  experiments  should  be 
performed  In  an  attempt  to  duplicate  this  behavior.  Nevertheless  this 
Isolated  result  suggests  that  error  reduction  may  In  fact  stagnate  due 
to  true  local  minima  In  the  objective  function  as  opposed  to  extremely 
slow  convergence  rates.  Even  If  this  conjecture  were  true,  the  summed 
objective  function  might  profitably  be  used  to  replace  error  reduction 
In  Iterative  sequences  that  also  employ  the  hybrid  Input-output 
algorithm. 

A  second  .  set  of  experiments  were  performed  with  the  eQ  (^(u) ) 
objective,  function ,,jn  conjunction  with  the  conjugate-gradient  mlnlmlzer. 
This  combination  managed  to  find  the  true  solution  for  a  very  small 
object.  The  object  support  was  an  Isosceles  right  triangle  with  seven 
pUels  on  a  side.  The  object  was  embedded  In  a  16  x  16  pixel  array. 
Unfortunately,  when  the  object  was  doubled  In  size  so  that  the  base 
frame  size  was  32  x  32  pixels,  the  algorithm  stagnated  without  finding 
the  true  solution.  It  Is  unclear  at  present  why  the  algorithm  worked 
for  the  small  Image  and  not  the  large  Image.  Perhaps  the  additional 
parameters  Increase  the  probability  of  encountering  local  minima. 

Preliminary  results  associated  with  the  eQ2 (g (x) )  objective  function 
are  reported  In  Section  6.3. 

6.7  CONCLUSIONS  AND  FUTURE  WORK 

We  have  shown  that  the  error  reduction  (ER)  algorithm  may  be  , 
Interpreted  as  a  constrained  steepest-descerit  algorithm  for  which  the 
objective  function  consists  of  the  Fourler-domain  error  metric  as  a 
function  of  pixel  values  In  the  latest  estimate.  In  addition  we  have 
proposed  three  new  objective  functions  for  performing  phase  retrieval 
using  gradient-search  methods.  These  include  (1)  use  of  the  summation 


objective  function  with  pixel  values  of  the  latest  estimate  as 
optimization  parameters,  (2)  use  of  the  object-domain  error  metric  with 
Input  pixel  values  as  parameters,  and  (3)  use  of  the  object-domain  error 
metric  with  Fourler-phase  values  as  parameters.  Analytic  expressions 
for  the  gradients  for  each  of  these  approaches  have  been  derived.  The 
simplicity  of  these  expressions  Implies  that  gradient-search  methods 
have  the  hope  of  being  computationally  tractable  and  even  competitive 
with  existing  Input-output  algorithms.  The  total  number  of  FFTs 
required  to  evaluate  the  objective  function  and  comoute  the  gradient  for 
each  of  these  approaches  Is  shown  In  Table  6-1. 


Table  6-1 

NUMBEP  OF  FFTS  REQUIRED  FOR  GRADIENT-SEARCH  APPROACHES 


Objective  Function 

JFFTs  to  evaluate 
objective  function 

IFFTs  to  evaluate 
gradient 

ep(fl(x)) 

1 

2 

e*(g(x)) 

1 

2 

e*(g(x)) 

2 

5 

•JWu» 

1 

2 

Of  course  extensive  experimentation  needs  to  occur  to  see  If  the 
surface  contour  of  each  proposed  objective  function  Is  well  suited  for 
gradient-search  methods.  Surface  contour  depends  upon  such  things  as 
the  Intrinsic  definition  of  the  objective  function,  the  particular  true 
object,  and  the  amount  of  noise  In  the  date.  The  suitability  of  a 
particular  gradient-search  algorithm  to  a  given  surface  contour 
manifests  Itself  In  the  convergence  rates  of  the  algorithm.  For  example 
a  memoryless  modified  Newton  method  [C.10]  may  converge  well  with  the 
same  objective  function  for  which  a  steepest-descent  algorithm 
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stagnates.  In  addition,  these  gradient-search  approaches  need  to  be 
tested  In  a  role  that  complements  current  Input-output  algorithms. 
Gradient-search  approaches  could  make  a  significant  contribution  to  the 
field  of  phase  retrieval,  should  they  consistently  provide  a  mode  of 
escape  from  any  of  the  various  types  of  stagnation  that  have  been  known 
to  appear  with  Input-output  algorithms. 
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SECTION  7 

MODELING  APPROACH  TO  PHASE  RETRIEVAL 


The  modeling  approach  Is  a  new  method  for  attempting  to  solve  the 
phase  retrieval  problem.  In  this  section  we  describe  the  modeling 
approach  In  general  terms,  and  then  discuss  a  particular  Implementation 
that  was  attempted. 

Let  F(u,v)  ■  IF(u,v)l  exp[1jf(u,v)]  be  the  complex  Fourier  transform 
of  a  particular  object.  Suppose  that  either  F(u,v)  over  the  entire 
measurement  aperture  or  F(u,v)  over  some  small  area  can  be  modeled  by  a 
parameterized  function,  Ms 

M(u,v,;a,b,,..)  -  IM(u,V}a,b...) I  exp[1^(u,v;a,b, ...)],  (7-1) 

where  a,b,...  are  unknown  parameters.  If  we  are  given  only  the  Fourier 
modulus,  IF(u,v)l,  then  It  might  be  possible  to  estimate  the  phase, 
1*(u,v),  by  (1)  finding  the  values  of  the  parameters  a,b,...  that  best 
fit  the  modulus  of  the  model,  IM(u,v;a,b,...)l,  to  I F(u, v) I ,  and  (2) 
evaluating  ^(u,v;a,b, . ..)  for  that  set  of  values  of  the  parameters. 

The  most  difficult  part  of  this  approach  Is  finding  a  model,  M,  that 
Is  suitable. 

In  a  first  attempt  at  using  the  modeling  approach,  each  small  area 
about  the  local  maxima  of  the  Fourier  modulus  was  modeled  using  a 
function  taken  from  the  control  theory  literature.  Suppose  that 
contours  about  a  local  maximum  of  the  Fourier  modulus,  at  a  level  3  dB 
down  from  the  local  maximum,  have  an  elliptical  shape,  with  the  major 
axis  of  length  wb  at  an  angle  0b  relative  to  the  u-axls  and  minor  axis 
w^.  Let  the  local  maximum  be  at  location  (ub,vb)  where  It  has  the  value 

Ab  -  !F(ub,vb)l.  (7-2) 
Also  define  the  distance  from  a  given  point  (u,v)  to  the  peak  (ub,vb)  as 
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and  let 


and 


w  -  [{u-uK)2  +  (v-vh)2]1/2, 


6  -  tan_1C(v-vb)/(u-ub)] , 


wc  ■  Wb  cos (flb-0)  +  Wd  si  n{9b-e). 

Then  the  model  we  used  for  a  region  about  the  local  maximum  Is 
M(w;Ab,0b,wb,wd,D) 

which  has  squared  modulus 


Vc 

w2  -  w2  +  12WW„D 
c  c 


I M  r 


Abwc 


(w2  -  w2)2  +  (2wwcD)2 


and  phase 


l  ■  -tan~1[2wwcD/(w2  -  w2)]. 


Note  that  the  parameters  W&,  w^  and  are  contained  within  wc, 
These  expressions  were  used  In  the  following  way: 

(1)  A  local  maximum  of  the  squared  Fourier  modulus  was  found. 


(2)  A  curve  fit  of  Eq. (7-7)  to  the  squared  Fourier  modulus  was 
to  estimate  the  unknown  parameters. 


(7-3) 

(7-4) 

(7-5) 

(7-6) 

(7-7) 

(7-8) 

performed 
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(3)  The  phase  In  that  region  was  computed  by  Eq.(7-8)  using  the 
parameter  estimates. 

(4)  Eq . (7-7)  was  evaluated  using  the  parameter  estimates  and  subtracted 
from  the  squared  Fourier  modulus,  leaving  the  residual  Fourier 
modulus. 

(5)  Repeat  steps  (1)  to  (4)  replacing  the  squared  Fourier  modulus  with 
the  residual  Fourier  modulus,  until  a.ll  the  major  local  maxima  are 
accounted  for. 

(6)  Form  the  net  Fourier  phase  as  the  sum  of  all  the  phase  functions 
obtained  In  step  (3). 

(7)  Form  an  Image  by  Inverse  Fourier  transforming  the  complex  function 
formed  from  the  given  Fourier  modulus  and  the  net  Fourier  phase. 

Note  that  for  large  w,  I  Ml 2  approaches  zero  and  ^  In  Eq . (7-8) 
approaches  zero,  so  the  model  has  strong  local  effect  near  each  local 
maximum  and  a  weaker  effect  on  neighboring  points. 

When  the  procedure  was  performed  for  a  SAR  Image  of  the  type  used  In 
the  digital  experiments  described  In  Section  5,  the  reconstructed  Image 
bore  no  resemblance  to  the  original  object.  The  reason  for  failure  Is 
not  totally  understood,  but  we  speculate  that  the  model,  Eq . (7-6) ,  Is 
not  appropriate  to  the  Fourier  transforms  of  SAR  Images. 

If  further  work  along  these  lines  were  to  be  pursued,  It  would  be 
Important  to  first  develop  more  appropriate  models  for  SAR  signal 
histories. 


SECTION  8 

LABORATORY  EXPERIMENTS 

Under  Task  3  of  the  program,  laboratory  experiments  were  performed 
to  demonstrate  ;  duced  tolerance  Imaging.  These  experiments  tested  the 
theoretical  developments  concerning  constraints,  measurements,  phase 
retrieval  and  Image  reconstruction  algorithms,  and  uniqueness  and 
sensitivity  issues  under  more  realistic  conditions  than  was  possible  In 
the  computer  simulations  performed  under  Tasks  1  and  2.  The  use  of  real 
objects,  Illumination  sources,  optics,  and  detectors  placed  greater 
demands  on  the  reconstruction  algorithm.  The  quality  of  the 
reconstructed  Image  from  experimental  data  was  compared  to  a  "ground 
truth"  Image  collected  In  the  laboratory  with  a  conventional  sensor 
having  an  equivalent  aperture. 

Two  experiments  simulating  different  types  of  systems  were 
undertaken:  an  active  coherent  experiment  In  the  visible  and  a  passive 
Incoherent  experiment  In  the  visible.  In  the  active  coherent 
experiment,  a  target  was  illuminated  with  a  laser  and  Intensity  data 
collected  In  the  far-fleld  of  the  target.  A  reconstruction  algorithm 
was  then  used  to  determine  the  phase  In  the  far-fleld  of  the  target  and 
therefore  an  Image  of  the  target.  Thus,  this  experiment  simulated  an 
active,  coherent  reduced-tolerance  sensor  which  measures  Intensities 
only  and  would  be  Insensitive  to  aberrations  of  the  primary  optics.  In 
the  passive  Incoherent  experiment,  a  multiple  mirror  telescope  was 
simulated  and  degraded/defocused  Image  Intensities  were  measured. 
Further  discussion  of  both  experiments  Is  given  In  Sections  8.1  and  8.2. 

8.1  ACTIVE  EXPERIMENT 

The  active  experiment  demonstrated  Imaging  of  a  coherently 
Illuminated  target  from  Intensity-only  measurements  made  In  the  far- 
fleld,  This  simulated  a  sensor  having  greatly  reduced  tolerance  to  the 
position  and  quality  of  Its  receiving  aperture  compared  to  a 


conventional  Imaging  sensor.  The  wide  range  of  parameters  which  were 
considered  In  planning  this  experiment  and  which  were  available  to  be 
varied  to  test  theoretical  developments  and  computer  simulation  results 
are  discussed  In  Section  8.1.1.  The  experiment  design  •.  d  data 
collection  are  explained  In  Section  8.1.2.  Data  processing  methods  and 
experimental  results  are  given  In  Section  8.1.3. 

8.1.1.  ACTIVE  EXPERIMENT  PARAMETERS 

A  permanent  phase  retrieval  laboratory  was  created.  In  this 
laboratory,  a  wide  range  of  parameters  Is  available  to  be  varied  to  test 
theoretical  developments  and  computer  simulation  results.  These 
parameters  had  to  be  carefully  controlled  to  ensure  meaningful  results 
In  the  active  experiment.  The  most  Important  of  these  parameters  are 
discussed  In  this  section. 

The  pattern  of  Illumination  on  the  target  can  be  described  by  Its 
spatial  and  temporal  coherence,  shape,  sharpness  of  edges,  phase 
distribution,  angle  of  Incidence  on  the  target,  and  polarization.  All 
of  these  parameters  may  affect  the  quality  of  the  reconstructed  Image. 
Equally  Important,  they  may  take  on  different  values  depending  on  the 
application  bolng  simulated  In  the  laboratory.  In  an  application  where 
the  target  Is  actively  Illuminated  by  a  laser,  the  spatial  (transverse) 
and  temporal  (longitudinal)  coherence  lengths  may  be  less  than  the 
target  size.  The  laboratory  system  can  allow  Illumination  with  variable 
coherence  lengths  either  by  manipulating  the  spatial  coherence  of  a  gas 
laser  or  by  using  a  broader-band  dye  laser.  It  Is  known  from  ERIM 
Investigations  that  the  Illumination  pattern  shape  and  sharpness  of  the 
edges  affects  reconstruction  algorithm  convergence.  In  applications, 
the  range  of  Illumination  shapes  and  sharpness  of  edges  may  be  limited 
by  practical  considerations  on  the  transmuting  aperture.  The 
i  laboratory  system  can  accept  a  variety  of  masks  and  image  them  onto  the 

!  target  through  a  controllable  finite  aperture  to  control  Illumination 
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shape  and  sharpness.  Since  the  Illumination  phase  may  not  be  constant 
over  the  target  In  practical  applications,  the  masks  can  be  holographic 
If  experimental  control  of  the  Illumination  pattern  phase  distribution 
Is  desired.  Applications  may  be  either  monostatic  or  bistatic,  so  the 
laboratory  system  allows  for  either.  The  target  will,  In  most  cases, 
partially  depolarize  the  Illumination.  The  experimental  system  Is 
capable  of  making  measurements  of  the  two  orthogonal  components  of  the 
light  at  the  detector. 

The  target  parameters  Include  reflectivity  contrast  and  structure, 
surface  roughness,  surface  topography  (3-D  nature  of  target),  motion 
during  the  measurement  process,  and  noncoherent  background  Illumination. 
Practical  targets  will  vary  In  their  roughness,  although  nearly  all  will 
be  rough  at  visible  arid  Infrared  wavelengths.  An  experimental  system 
should  primarily  use  rough  targets  to  create  real  speckle  effects. 
However,  It  was  useful  In  the  active  experiment  to  use  smooth  targets 
(film  transparencies  In  a  liquid  gate)  In  setting  up  the  experiment  to 
test  and  debug  the  optical  and  electronic  components  and  software.  Real 
targets  are  three-dimensional,  but  to  varying  degrees.  A  variety  of  3-D 
objects  is  available  to  the  experimenter.  Any  real  target  will  also  be 
noncoherently  Illuminated  from  various  thermal  sources.  While  this 
Illumination  will  only  add  a  uniform  bias  to  the  far-fleld  measurements, 
It  can  be  Included  In  the  experimental  setup. 

The  propagation  path  between  the  target  and  the  sensor  can  be  of 
such  a  length  as  to  be  either  near  or  far  field  and  can  Include 
atmospheric  turbulence,  scattering  (aerosols,  fog,  smoke),  and 
absorption.  In  an  application,  any  or  all  of  these  effects  may  be 
present.  The  optical  setup  allows  for  Insertion  or  removal  of  lenses  to 
give  either  near  or  far  field  conditions  at  the  detector.  (It  must  be 
noted  that  the  size  of  the  speckles  In  the  measurement  plane  will  depend 
on  the  sensor  distance.)  Turbulence  with  the  proper  statistical 
properties  Is  very  difficult  to  simulate  in  an  Indoor  laboratory.  The 
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best  approach  (and  one  w'  .  allows  reproducible  turbulence)  Is  to  use 
movable  phase  plates.  These  are  glass  plates  with  controlled  thickness 
variations.  Scattering  and  absorption  are  easier  to  simulate  (e.g., 
with  fog  chambers,  optical  narrow-band  filters).  Since  their  main 
effects  are  to  reduce  signal  levels  and  Increase  detector  bias  light 
levels,  they  can  be  studied  as  described  In  the  next  paragraph  on 
detector  parameters. 

The  most  Important  detector  parameters  are  signal  level,  type  of 
noise,  noise  level,  background  Illumination  level,  spatial  and  temporal 
sampling  rates,  polarization  detected,  non! Inearl  ties  In  response,  and 
spatial  nonuniformities  In  response,  bias  and  noise.  The  values  of 
these  parameters  are  crucial  to  the  viability  of  any  real  application. 
The  experimental  setup  Is  able  to  vary  the  signal  and  background 
Illumination  levels,  simulate  various  sampling  rates,  detect  orthogonal 
polarizations,  and  create  nonuniform  background  Illumination  levels. 
The  types  of  noise,  noise  level,  nonuniformities  In  detector  response 
and  noise  (pattern  noise),  and  nonl Inearl  ties  In  response  are  primarily 
determined  by  the  detector  chosen  for  the  experiment.  An  Important 
aspect  of  the  experiment  design  and  procedure  was  to  measure,  calibrate, 
and  correct  for  the  effects  of  these  parameters  on  the  measured  data. 
This  was  a  difficult  task  and  much  practical  experience  was  gained  which 
can  be  applied  to  other  detectors  In  the  future. 

Many  applications  Involving  active  Illumination  can  be  expected  to 
have  low  signal  levels.  To  adequately  simulate  these  applications,  an 
Image  Intenslfler  can  be  used  before  the  detector.  Even  with  a  thermal- 
noise-limited  detector,  the  use  of  an  Image  Intenslfler  may  allow 
operation  In  a  shot  (photon)  noise  limited  mode.  The  Image  Intenslfler 
will,  of  course,  also  have  nonuniformities,  nonllnearltles,  and  a 
spatial  sampling  rate  which  must  be  measured  and  considered  In  the 
experimental  setup  and  data  analysis. 
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The  effects  of  speckle  are  so  Important  to  an  active  coherent 
experiment  that  Its  parameters  deserve  to  be  discussed  separately.  The 
speckle  In  the  measurement  plane  will  have  Its  size  determined  chiefly 
by  the  size  of  the  Illuminated  region  on  the  target  and  by  the  optics 
(If  any)  placed  between  the  target  and  the  sensor.  Ideally,  the 
detector  must  sample  the  Intensity  speckle  pattern  at  the  Nyqulst  rate 
(two  samples  per  speckle)  or  greater.  For  a  given  detector,  magnifying 
optics  may  be  necessary.  It  may  also  be  desired  to  Investigate  the 
effect  of  measurements  at  less  than  the  Nyqulst  rate. 

The  data  processing  hardware  and  software  available  In  the  phase 
retrieval  laboratory  Is  discussed  In  Appendix  H. 

8.1.2.  ACTIVE  EXPERIMENT  DESIGN  AND  DATA  COLLECTION 

The  optical  system  used  to  obtain  the  results  shown  In  Section  8.1.3 
Is  shown  In  Fig.  8-1.  An  argon-ion  laser  beam  of  wavelength  X  »  0.5145 
/urn  Is  spatially  filtered,  collimated,  and  used  to  Illuminate  a 
transmissive  object  (some  experiments  with  reflective  objects  were  also 
performed).  The  object  consists  of  a  binary  mask  placed  In  contact  with 
a  ground  glass  [see  Fig.  8-2];  thus  the  transmittance  of  the  object  Is 
binary  In  Intensity  and  random  In  phase.  An  a  priori  Image  support 
constraint  Is  Introduced  by  giving  the  transmissive  region  of  the  mask  a 
known  overall  triangular  shape.  The  lens  Li  of  focal  length  fi  produces 
the  Fourier  transform  of  the  complex-valued  object  transmittance  In  Its 
back  focal  plane.  There,  an  aperture  A  selects  a  portion  of  the  Fourier 
transform  and  lenses  Lg  and  L3  (of  focal  lengths  f2  and  f 3 , 
respectively)  Image  this  portion,  with  suitable  magnification,  to  the 
detector  for  collection  of  Fourier  Intensity  data.  When  the  removable 
mirrors  Mi  and  Mg  are  in  place,  the  light  is  diverted  through  lens  L4 
which  produces  an  Image  of  the  object  at  the  detector.  Because  of  the 
placement  of  aperture  A,  this  conventional  Image  provides  a  reference 
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FIGURE  8-1*  EXPERIMENTAL  OPTICAL  SYSTEM  FOR  ACTIVE  EXPERIMENT 
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Detector 


for  comparison  to  the  Image  reconstructed  by  phase  retrieval.  A 
polarizer  was  placed  just  before  the  detector  to  ensure  detection  of 
only  a  single  polarization. 

The  base  and  height,  d,  of  the  triangular  mask  and  the  lens  focal 
lengths  must  be  chosen  so  that  the  speckle  In  the  Fourier  Intensity  at 
the  detector  Is  adequately  sampled.  Assuming  a  speckle  size  at  the 
detector  equal  to  Xfif3/f2d,  then,  to  sample  the  Intensity  at  the 
Nyqulst  rate,  the  detector  pixel  spacing,  As,  must  equal  Xfif3/2f2d.  In 
the  experiment,  d  was  about  16  mm  and  f i ,  f2»  and  fj  were  500  mm,  50  mm, 
and  300  mm,  respectively,  giving  As  a  48  /am.  The  CCD  detector  used  has 
horizontal  and  vertical  pixel  spaclngs  of  30  /im  and  18/im,  respectively, 
so  the  data  were  sampled  at  greater  than  the  Nyqulst  rate. 

Since  the  Fourier  Intensity  has  twice  the  spatial  frequency 
bandwidth  of  the  complex-valued  Fourier  transform,  measurement  of  2Nj  by 
2N2  Nyqul st-spaced  samples  of  the  Fourier  Intensity  enables 
reconstruction  of  a  complex-valued  Image  of  Nj  by  N2  resolution 
elements.  The  number  of  horizontal  and  vertical  pixels  in  the  detector 
array  thereby  sets  an  upper  limit  on  the  size,  In  resolution  elements, 
of  the  reconstructed  Image.  In  this  experiment,  the  central  180  by  256 
pixels  (over  a  5.4  by  4.6  mm  region)  of  the  detector  were  used.  The 
aperture  A  was  therefore  0.9  by  0.77  mm  and  the  reconstructed  and 
conventional  Images  had  56  by  48  resolution  elements  of  size  0.28  by 
0.34  mm  (at  the  object). 

The  diameter,  D,  of  lens  Li  was  50  mm  which  exceeds  the  diameter 
JT(d  +  2fi  tan  $)  a  25  mm  (where  sin  6  ■  XN/2d  and  N  is  either  Nj  or  N2) 
needed  to  avoid  vignetting.  The  other  lenses  need  also  be  only  f/20. 
The  focal  length,  f4,  of  lens  L4  was  1000  mm,  which  resulted  in  a 
deniagnlflcatlon  of  the  conventional  Image  by  a  factor  of  three  while 
still  allowing  more  than  adequate  sampling  by  the  detector. 
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The  data  was  detected  by  a  Fairchild  CCD  3000F  television  camera 
with  a  fiber  optic  faceplate.  The  RS17G  video  signal  was  converted  to  a 
512  by  512,  8-blt  digital  Image  using  an  Imaging  Technology  IP-512  video 
processor.  For  both  Fourier  Intensity  and  Image  data,  a  single  video 
frame  of  data  was  digitized  and  a  second  dark  frame  was  digitized  and 
subtracted  to  remove  pattern  noise.  The  automatic  gain  control  of  the 
camera  was  disabled  and  the  laser  output  was  adjusted  so  that  the 
brightest  speckle  nearly  saturated  the  detector.  Since  the  digitizer 
sampling  rate  was  not  matched  to  the  detector  horizontal  pixel  spacing, 
a  Matthey  MLW  401B  low-pass  video  filter  with  a  3-dB  width  of  4.3  MHz 
was  used  to  reduce  the  effect  of  CCD  clock  noise  on  the  digitized  data. 

The  support  constraint  was  measured  by  Increasing  the  size  of  the 
aperture  A,  digitizing  the  resulting  high  resolution  Image  of  the 
object,  and  measuring  the  base  and  height  of  the  triangle  In  the  digital 
Image.  Since  the  focal  lengths  of  the  lenses  and  the  digitizer  pixel 
horizontal  spacing  are  not  known  exactly,  calibration  measurements  must 
also  be  made  to  determine  the  spatial  scaling  of  both  the  Fourier 
Intensity  and  Image  data.  This  was  done  using  an  object  consisting  of 
two  circular  apertures  separated  by  about  four  times  their  diameter.  By 
orienting  this  object  both  horizontally  and  vertically,  gathering  both 
Fourier  Intensity  data  and  high  resolution  linage  data,  and  using  the 
known  digitizer  pixel  vertical  spacing  (which  Is  equal  to  the  detector 
pixel  spacing  of  18  /am) ,  the  spatial  scaling  of  the  data  may  be 
computed.  For  reference,  the  digitizer  pixel  horizontal  spacing  was 
determined  to  be  21,2  *  0.1  /am,  yielding  256  samples  over  a  5.4-mm  width 
of  the  detector. 

To  estimate  the  slgnal-to-nolse  ratio  of  the  data,  the  detector  was 
uniformly  Illuminated  with  an  extended  noncoherent  source,  10  frames 
were  digitized,  and  the  standard  deviation  of  the  digitized  values  was 
computed  on  a  pixel  by  pixel  basis.  The  standard  deviations  of  the 
8-blt  data  ranged  from  0.8  to  1.3.  (Part  of  this  variation  could  be  due 
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to  variations  In  light  source  Intensity  during  the  collection  of  the  10 
frames.)  For  signals  which  nearly  saturate  the  detector,  the  slgnal-to- 
detector  noise  ratio  of  the  digital  data  Is  therefore  about  200  to  1. 
Data  for  correction  of  spatial  variations  In  detector  response  were 
collected  by  uniformly  Illuminating  the  detector  at  one-tenth  the 
saturation  light  level,  summing  10  digitized  frames,  and  subtracting  10 
dark  frames.  The  ratio  of  the  standard  deviation  to  the  spatial  mean  of 
the  response  was  about  3%. 

8.1.3  DATA  PROCESSING  AND  EXPERIMENTAL  RESULTS 

Data  processing  began  with  a  256  by  256  array  of  digitized  Fourier 
Intensity  data  shown  In  Fig.  8-3,  This  array  was  divided  by  the 
response  data  to  correct  for  spatial  variations  In  detector  response. 
The  speckle  contrast  was  then  measured  and  found  to  be  80%.  Since  the 
speckle  should  have  100%  contrast,  It  was  assumed  that  some  positive 
bias  must  be  present  In  the  data,  possibly  due  to  the  effect  of  the  low 
pass  video  filter.  Therefore,  a  constant  was  subtracted  from  the  data 
to  Increase  the  contrast  to  90%.  Data  values  which  become  negative  In 
this  process  were  set  to  zero.  The  Fourier  transform  of  this  data  Is 
the  autocorrelation  of  the  Image.  Since  the  image  support  Is  known,  the 
autocorrelation  support  Is  also  known.  Therefore,  to  reduce  high 
frequency  noise  in  the  Fourier  Intensity  data,  It  was  low-pass  filtered 
by  zeroing  out  those  parts  of  Its  Fourier  transform  which  lay  outside 
the  support  of  the  autocorrelation  of  the  known  triangular  Image 
support.  After  setting  any  negative  values  to  zero,  this  filtered 
Fourier  Intensity  data  was  then  square-rooted  to  give  Fourier  modulus 
data. 

The  Fourier  modulus  data  and  the  triangular  Image  support  constraint 
were  used  In  the  Iterative  Fourier  transform  phase  retrieval  algorithm 
[8. 1-8. 4].  Several  cycles  (each  of  30  to  40  Iterations  of  hybrid  Input- 
output  with  p  »  0.7  followed  by  10  to  20  error-reduction  Iterations) 
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were  performed  for  a  total  of  840  Iterations.  At  this  point  the 
algorithm  was  making  no  further  progress.  The  normalized  root-mean- 
squared  Image-domain  error,  E0  [8. 1-8. 4],  was  0.112;  that  is,  the 
reconstructed  Image  was  within  11%  of  agreeing  with  the  measured  Fourier 
modulus  data  and  the  support  constraint.  This  Is  an  Indication  of  the 
amount  of  noise  and  distortion  present  In  the  measured  data. 

The  squared-magnitude  of  the  complex-valued  reconstructed  Image  Is 
shown  In  Fig.  8-4 (a)  and  a  conventional  (ground  truth)  Image  having  the 
same  spatial  frequency  bandwidth  Is  shown  In  Fig.  8-4(b).  The 
reconstructed  Image  was  resampled  to  approximately  match  the  dimensions 
of  the  conventional  Image,  Many  of  the  bright  speckles  appear  In  the 
same  location  In  each  Image  Indicating  that  some  of  the  high  frequency 
Information,  as  well  as  the  low  frequency  Information,  has  been 
successfully  reconstructed.  Since  several  minutes  elapsed  betwet., 
collecting  the  Fourier  Intensity  data  and  collecting  the  conventional 
Image,  some  of  the  difference  between  the  two  Images  may  be  attributable 
to  speckle  boiling  during  the  time  Interval. 

8.2  PASSIVE  EXPERIMENT 

The  objective  of  the  passive  experiment  Is  to  demonstrate  Imaging 
(In  the  visible)  of  a  noncoherently  Illuminated  target  from  Intensity- 
only  measurements.  In  conventional  passive  Imaging  systems  where  the 
Image  Is  degraded  by  phase  aberrations  due  either  to  atmospheric 
turbulence  or  to  misalignment  of  segmented  optical  element  arrays,  it  Is 
known  that  Image  quality  can  be  Improved  by  using  Iterative 
reconstruction  algorithms  operating  on  two  2-D  Intensity  measurements 
[8. 5-8. 6] .  These  two  measurements  can  be  of  a  best  focus  Image  and  an 
Intentionally  slightly  defocused  one  (that  Is,  a  quadratic  phase  error 
Is  Intentionally  Introduced  --  phase  diversity).  The  use  of  this 
approach  allows  reduced  tolerance  to  atmospheric  phase  or  to  accurate 
positioning  and  alignment  of  segmented  optics. 


(a) 


(b) 


FIGURE  8-4.  IMAGE  RECONSTRUCTION  FROM  EXPERIMENTAL  DATA,  (a)  Intensity 
of  coherent  Image  reconstructed  by  phase  retrieval,  (b)  Intensity  of 
conventional  coherent  Image. 
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The  passive  experiment  simulated  an  Imaging  system  In  which  the 
image  Is  degraded  by  misalignment  of  segmented  optics  In  the  pupil 
plane.  Two  Imago  Intensity  measurements  were  made:  one  In  the  plane  of 
best  focus  of  the  degraded  Image  and  one  In  a  plane  where  the  Image  Is 
defocused.  The  separation  of  these  two  planes  (and  therefore  the  amount 
of  quadratic  phase  error  in  the  pupil  plane  giving  the  defocused  Image) 
was  measured.  An  Iterative  algorithm  (using  gradient-search  techniques) 
was  used  In  an  attempt  to  estimate  the  (phase)  misalignment  of  the 
segmented  optics. 

The  optical  setup  used  In  the  experiment  is  shown  In  Fig.  8-5.  The 
two  lens  Imaging  configuration  of  Lg  and  L3  allowed  access  to  the  pupil 
plane.  A  computer-generated  holographic  element  P  In  the  pupil  plane 
allowed  phase  misalignment  of  segmented  optics  to  be  simulated.  The  six 
segment  hologram  aperture  Is  shown  in  Fig.  8-6.  Phase  shifts  of  the 
hologram  carrier  frequency  simulated  piston  errors  In  the  segmented 
optics  alignment.  Since  the  hologram  Is  computer-generated,  the  piston 
errors  are  known.  Another  advantage  of  the  setup  Is  that  a  "ground 
truth"  Image  can  be  formed  using  the  undiffracted  light. 

In  order  to  make  the  laser  light  spatially  Incoherent,  the 
unexpanded  laser  beam  first  Illuminates  a  stationary  ground  glass,  Gi , 
and  then  a  rotating  ground  glass,  G2.  The  center  of  the  laser  beam  was 
approximately  60mm  from  the  axis  of  rotation  of  the  ground  glass.  The 
target  T  used  In  the  experiment  was  a  segment  of  a  standard  bar  target. 
The  lens  l\  of  100mm  focal  length  Images  the  bar  target  onto  an 
adjustable  rectangular  aperture  A  that  blocks  everything  except  elements 
4  (45.3  lp/mm)  and  5  (50.8  lp/mm)  of  group  5.  Note  that  the  100mm  focal 
length  lens  magnifies  the  target  by  roughly  2x.  The  two  lenses  L2  and 
L3  of  focal  lengths  50mm  and  300mm  Image  the  target  onto  the  sensor  with 
6x  magnification.  The  carrier  frequency  of  the  hologram  was  chosen  so 
that  the  undiffracted  and  diffracted  images  are  separated  at  the 
detector  (a  Fairchild  CCD3000  camera). 
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FIGURE  8-5.  EXPERIMENTAL  OPTICAL  SYSTEM  FOR 


PASSIVE  EXPERIMENT. 


A  high-resolution  Image  of  the  object  was  collected  by  removing  the 
hologram  and  Imaging  through  a  circular  aperture  of  approximately  2cm 
diameter.  The  hologram  was  then  Inserted  and  the  undiffracted  Image  (A 
In  Fig.  8-5)  was  collected.  Because  of  the  low  diffraction  efficiency 
of  the  hologram,  all  diffracted  Images  were  obtained  by  adding  32  frames 
of  data,  then  blocking  the  laser  beam  and  subtracting  32  dark  frames.  A 
best  focus  Image  (B)  was  digitized  and  Images  (C)  at  1,  2,  and  3  waves 
of  defocus  also  collected.  To  compute  the  defocus  distance  Az,  the 
following  formula  was  used, 


Az 


1 

1/f  -  2n \/p 


(8-1) 


where  f  Is  the  focal  length  of  the  second  Imaging  len3  (300  mm),  n  Is 
the  number  of  waves  defocus,  p  Is  the  radius  of  the  hologram  (1.22  mm) 
and  X  ■  0.5145  /an. 


The  final  data  set  collected  was  a  responslvlty  map  of  the  detector 
array.  A  distant  Incoherent  source  Illuminated  the  array  at  a  light 
level  close  to  the  peak  value  for  the  diffracted  Images.  The  top  and 
right  sides  of  the  array  were  masked  to  provide  reference  regions. 
Thirty-two  frames  were  added  and  32  dark  frames  were  subtracted  to  give 
the  responslvlty  map.  This  data  was  processed  using  an  Iterative 
gradient  search  algorithm  In  an  attempt  to  estimate  the  (known)  piston 
errors.  Although  several  variations  In  the  choice  of  defocus  data  and 
In  the  Initial  starting  point  for  the  search  have  been  tried,  to  date  no 
successful  (low  error)  estimate  has  resulted. 


8.3  CONCLUSIONS 


Phase  retrieval  from  Fourier  Intensity  data  from  a  coherently 
Illuminated,  diffuse  object  collected  In  a  laboratory  experiment  has 
been  demonstrated.  The  image-domain  constraint  for  the  Iterative 
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Fourier  transform  reconstruction  algorithm  was  a  known  triangular 
support.  The  reconstructed  Image  shows  good  agreement  with  a 
conventionally  obtained  Image.  With  this  promising  beginning,  further 
laboratory  experiments  are  warranted.  Image  reconstruction  accuracy 
should  be  Investigated  as  a  function  of,  for  example,  object  support 
shape,  sharpness  of  edges  of  support,  object  contrast,  object  surface 
roughness,  detected  data  slgnal-to-nolse  ratio,  data  sampling  rate, 
detector  calibration,  data  filtering,  and  Iterative  algorithm  versions 
used. 

Laboratory  data  was  collected  for  an  Incoherently  Illuminated  object 
Imaged  through  a  holographically-simulated  misaligned  segmented  optical 
system.  A  lack  of  sufficient  funds  prevented  us  from  refining  the 
experiment  to  the  point  where  useful  Imagery  could  be  reconstructed  from 
the  laboratory  data,  and  so  no  conclusions  can  be  drawn  from  that 
experiment.  Nevertheless  .  we  remain  optimistic  that  with  further 
refinements  the  technique  of  phase  diversity  can  allow  us  to  recover 
diffraction-limited  Images  from  badly  misaligned  segmented-aperture 
telescopes,  as  had  been  demonstrated  earlier  by  computer  simulations. 
We  recommend  that  further  experiments  be  carried  out  to  verify  the 
usefulness  of  this  concept.  If  successful,  this  concept  could 
dramatically  Increase  the  performance  or  decrease  the  cost  of  segmented 
optical  Imaging  systems. 
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SECTION  9 

APPLICATION  AND  ARCHITECTURE  STUDY 

9.1  INTRODUCTION 

In  this  section,  the  practicality  of  the  reduced  tolerance  Imaging 
concept  Is  Investigated  for  a  tactical  Imaging  application  using  laser 
Illumination.  The  optical  and  data  processing  architectures  are  defined 
for  the  Imaging  system  and  relationships  are  determined  between  system 
performance  measures  and  system  design  parameters.  For  a  specific 
wavelength  and  desired  performance,  numerical  values  of  the  system 
design  parameters  are  computed. 

9.2  TACTICAL  IMAGING  APPLICATION 

The  application  to  be  considered  Is  that  of  tactical  imaging  using 
laser  Illumination.  As  shown  In  Fig.  9-1,  both  air-to-ground  and 
ground-to-air  Imaging  can  be  treated  by  the  same  analysis  although  the 
primary  application  Is  air-to-ground  since  It  Is  required  that  the  area 
of  reflectivity  fill  the  Illumination  pattern.  A  pulsed  laser  would  be 
used  to  illuminate  the  target  region  and  reflected  laser  light  would  be 
collected  by  the  receiver.  The  key  parameters  specifying  system 
performance  are 

Resolution,  d 

Number  of  resolution  elements  In  Image,  N  by  N 

RMS  error  In  the  Image,  e  . 

The  use  of  the  reduced  tolerance  Imaging  concept  offers  (1)  the 
possibility  of  attaining  desired  values  of  these  parameters  at  lower 
cost,  lower  weight,  less  complexity  of  the  optical  system,  and  less 
disruption  of  aircraft  aerodynamics  and  (2)  the  possibility  of 
diffraction-limited  Imaging  through  atmospheric  or  aircraft  turbulence. 


9.3  SYSTEM  ARCHITECTURE 


To  use  the  reduced  tolerance  Imaging  concept,  the  Imaging  system 
will  have  optical  and  data  processing  architectures  that  are 
significantly  different  from  those  of  conventional  Imaging  systems.  The 
transmitting  and  receiving  optics  are  shown  In  Fig.  9-2.  A  pulsed  laser 
beam  Is  spatially  filtered,  expanded,  and  passed  through  a  mask.  The 
mask  Is  Imaged  to  the  target  region  to  provide  an  lllumlnatlon-pattern- 
determlned  support  constraint.  For  efficient  Illumination  of  the  mask 
(which  might  consist  of  a  transmitting  region  of  triangular  shape), 
holographic  beamforming  elements  should  be  considered.  To  produce  an 
Image  of  the  mask  which  has  low  sldelobes,  apodlzation  of  the  Imaging 
system  Is  necessary.  Low  sldelobes  are  required  to  minimize  the 
difficulties  encountered  In  reconstructing  Images  using  tapered  support 
constraints.  Although  a  lens  Is  used  to  Image  the  mask  In  Fig.  9-2, 
reflective  optics  could  also  be  used. 

Two  versions  of  the  receiver  optics  are  shown  In  Fig.  9-2.  Both 
make  far-fleld  intensity  measurements  of  the  laser  light  reflected  from 
the  target  region.  The  conformal  detector  consists  of  an  appropriate 
photon-to-electron  conversion  material  applied  conformally  to  the 
surface  of  a  vehicle  (for  example,  to  the  fuselage  of  an  aircraft).  If 
this  type  of  detector  were  to  be  developed,  It  would  certainly  be  the 
lightest,  least  complex,  and  least  disruptive  of  aerodynamics  of  any 
detector  approach.  The  other  detector  approach  shown  in  Fig.  9-2  uses  a 
lenslet  array  to  focus  the  light  In  each  subaperture  of  the  receiver 
onto  a  single  detector.  Although  heavier  and  more  complex  than  the 
conformal  detector,  this  detector  design  Is  achievable  with  current 
technology,  permits  greater  control  of  the  field  of  view  of  the 
receiver,  and  should  not  significantly  affect  aerodynamics.  Either 
detector  will  use  a  narrow-band  filter  to  eliminate  light  at  wavelengths 
other  than  that  of  the  laser  and  a  polarizer  to  select  only  a  single 
polarization.  It  should  be  noted  that  neither  detector  Is  affected  by 
turbulence  which  appears  as  a  phase  aberration  In  the  receiver  aperture. 
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FIGURE  9-2.  TRANSMITTING  AND  RECEIVING  OPTICS  FOR  REDUCED  TOLERANCE 
IMAGING. 


A  block  diagram  of  the  data  processing  architecture  Is  shown  In  Fig. 
9-3.  The  far-'f  1  eld  Intensity  data  from  the  receiver  Is  first  pre- 
processed  to  reduce  detector  noise,  background  bias,  fixed  pattern 
effects,  and  nonlinear  and  nonuniform  detector  response  effects.  The 
resulting  data  Is  then  Input  to  the  Iterative  Fourier  transform 
algorithm  along  with  the  Image  support  constraint  as  determined  by  the 
Illumination  pattern.  Within  the  Iterative  Fourier  transform  algorithm, 
the  error  reduction,  hybrid  Input-output,  and  enlarging  mask  options 
would  be  used.  The  image  reconstructed  by  the  algorithm  could  be 
postprocessed  to  reduce  remaining  noise  or  to  reduce  speckle,  If 
desired. 

The  hardware  required  to  perform  this  data  processing  will  vary  with 
the  time  allowed  to  produce  the  reconstructed  Image.  The  dominant 
computational  operation  Is  the  fast  Fourier  transform  (FFT)  required  by 
the  Iterative  algorithm.  To  reconstruct  an  N  by  N  Image,  a  2N  by  2N  FFT 
Is  required  which  In  turn  requires  8N2log2N  floating  point  multiplies. 
There  are  2  FFTs  per  Iteration,  so,  for  a  reconstruction  requiring  M 
Iterations,  the  number  of  multiplies  required  Is  16MN21og2N.  The  other 
computations  required  (such  as  constraint  enforcement,  pre-  and  post¬ 
processing)  are  small  In  comparison,  For  M»100  and  N»128,  the  total 
number  of  multiplies  required  for  a  reconstruction  Is  about  184  x  I0b, 
Current  or  near-term  single  VHSIC  chips  should  achieve  40  megaflop 
computation  rates,  so  a  small  special-purpose  processor  utilizing  as  few 
as  5  VHSIC  chips  could  reconstruct  a  128  by  128  Image  In  less  than  a 
second, 

9.4  SYSTEM  DESIGN  PARAMETERS 

This  section  lists  the  parameters  which  must  be  determined  In  a 
system  design  and  derives  important  relationships  between  these 
parameters  and  the  system  performance  parameters  defined  In  Sec.  9.2. 
The  key  parameters  In  system  design  are 
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FIGURE  9-3.  DATA  PROCESSING  ARCHITECTURE. 
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Transmitter: 

User  wavelength,  X 

Laser  pulse  energy,  Et 

Laser  pulse  length,  Ar 

Laser  spatial  coherence 

Laser  temporal  coherence 

Transmitter  aperture  diameter,  Dt 

Transmitter  optical  efficiency,  tt 


Optical  path: 

Two-way  path  transmittance,  exp(«2aR) 


Target! 


Target  range,  R 
Target  reflectivity,  r 
Illumination  pattern  taper 

In  Image  resolution  elements,  n 


Receiver: 


Receiver  aperture  diameter,  Dr 
Receiver  optical  transmittance,  tr 
Number  of  detector  elements,  2N  by  2N 
Detector  quantum  efficiency,  t) 

Detector  thermal  and  readout  noise 

standard  deviation  In  electrons,  nt 


a 


9.4.1  ILLUMINATION  PATTERN  TAPER 


One  Important  relationship  between  some  of  these  parameters  concerns 
Illumination  pattern  taper.  The  size  of  a  resolution  element  In  the 
illumination  pattern  at  the  target  Is  approximately  1„5XR/Dt.  The 
factor  of  1.5  Is  present  because  the  aperture  Is  apodlzed  to  reduce 


sldelobes.  Since  the  resolution,  d,  of  the  Image  (measured  at  the 
object)  Is  given  by 

d  -  XR/Dr, 

the  taper,  n,  In  Image  resolution  elements  Is 

n  ■  1.5Dr/Dt  • 

It  Is  desirable  for  the  taper  to  be  small  since  then  the  Image 
reconstruction  will  have  lower  error  for  a  given  measurement  slgnal-to- 
nolse  ratio  (SNR).  It  Is  also  desirable  for  the  transmitter  aperture  to 
be  small  since  then  the  transmitter  optics  will  be  lighter,  less 
disruptive  of  aerodynamics,  and  less  affected  by  turbulence.  For  a 
fixed  receiver  aperture,  the  above  relationship  Indicates  that  a  design 
tradeoff  must  be  made  between  taper  and  transmitter  aperture. 

9.4.2  DETECTOR  ELEMENT  COLLECTION  AREA 

Because  the  far-fleld  Intensity  to  be  measured  has  twice  the 
bandwidth  of  the  far-fleld  complex  amplitude,  the  number  of  measurements 
required  Is  2N  by  2N  to  enable  reconstruction  of  an  N  by  N  Image.  The 
far-fleld  Intensity  Is  a  speckle  pattern  with  each  speckle  approximately 
of  diameter  XR/Nd  where  Nd  Is  the  diameter  of  the  Illuminated  target 
region.  To  achieve  the  necessary  Nyqulst  sampling  of  the  speckle 
pattern,  the  sample  spacing  must  be  equal  to  half  the  speckle  diameter. 
Each  detector  element,  whether  of  the  conformal  or  lenslet  array  type, 
must  therefore  collect  the  light  over  a  region  no  larger  In  diameter 
than  XR/2Nd. 

If  it  Is  assumed  that  the  target  region  scatters  light  uniformly 
over  2r  steradlans,  then  the  fraction  of  the  reflected  light  collected 
by  each  detector  element  (collection  efficiency)  Is  given  by  the  ratio 


2 

of  the  collection  area  (of  a  detector  element)  to  2*R  .  Using  the 
diameter  XR/2Nd  given  In  the  previous  paragraph,  the  result,  assuming  a 
circular  collection  area,  Is 

collection  efficiency  ■  (X/Nd) 2/32  . 

It  Is  Important  to  note  that  although  collection  efficiency  does  depend 
on  the  size  of  the  region  to  be  Imaged,  It  does  not  explicitly  depend  on 
target  range  as  long  as  the  detector  elements  are  sized  according  to  the 
range  to  the  target. 

9.4.3  RECEIVED  ENERGY 

The  optical  energy,  Er,  received  by  a  single  detector  element  is  the 
product  of  the  laser  pulse  energy,  the  transmitter  optical  efficiency, 
the  two-way  path  transmittance,  the  target  reflectivity,  the  detector 
collection  efficiency,  and  the  receiver  optical  efficiency.  Using  the 
notation  defined  at  the  beginning  of  Sec.  9.4,  the  received  optical 
energy  Is 


Er  ■  Et  tt  tr  r(X/Nd)2exp(-2aR)/32  . 

Given  that  the  parameters  tt,  tf,  r,  a,  and  R  are  either  determined  by 
the  application  or  will  be  made  as  large  as  possible,  this  equation 
states  that,  to  achieve  a  desired  Ef  (and  therefore  a  desired  SNR  and 
RMS  Image  error),  a  tradeoff  must  be  made  between  laser  pulse  energy, 
Et,  and  the  area,  (Nd)  ,  of  the  region  to  be  Imaged. 
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9.4.4  MEASUREMENT  SIGNAL-TO-NOISE  RATIO 

Assuming  that  the  measured  data  can  be  corrected  for  any  background 
bias,  fixed  pattern  effects,  and  nonlinear  and  nonuniform  detector 
response  effects,  then  the  primary  remaining  degredatlons  of  the  data 
will  be  due  to  the  Poisson  statistics  of  the  received  photons  and  the 
thermal  noise  of  the  detector.  The  mean  number  of  signal 
photoelectrons,  ns,  generated  per  detector  element  by  the  received 
energy  Is 


ns  -  »?Er/ (hc/X) 

where  n  Is  the  detector  quantum  efficiency;  the  Planck  constant,  h, 

-34  8 

equals  6.63  x  10  Joule  sec;  and  the  speed  of  light,  c,  equals  3  x  10 

m/sec.  The  standard  deviation,  np,  of  the  number  of  photoelectrons,  for 

Poisson  statistics,  Is 


n 


P 


The  SNR  of  the  detected  far-fleld  Intensity  data  Is 


SNR 


n 


s 


where  nt  Is  the  standard  deviation  of  the  detector  thermal  and  readout 
noise.  Depending  on  the  wavelength  region  and  the  detector  used,  either 
Poisson  or  thermal  noise  may  dominate  the  denominator  of  this 
expression. 


9.4.5  LASER  PULSE  LENGTH  AND  COHERENCE 


The  ability  to  reconstruct  an  Image  from  far-fleld  Intensity 
measurements  depends  on  the  far-fleld  complex  amplitude  accurately 
representing  the  Fourier  transform  of  the  complex-valued  reflectivity  of 


the  Illuminated  region.  This  In  turn  Implies  that  the  laser  spatial 
coherence  must  equal  or  exceed  the  width  of  the  Illuminated  region  and 
that  the  temporal  coherence  must  equal  or  exceed  the  depth  (along  the 
1 1 ne-of-sl ght  from  the  Imaging  system)  of  the  region.  For  the  case  of  a 
target  located  on  the  ground,  the  depth  of  the  Illuminated  region  will 
depend  on  the  slant  angle  of  the  Illumination.  The  temporal  coherence 
of  the  laser  will  therefore  determine  the  maximum  Illumination  angle 
relative  to  the  normal  to  the  ground  plane. 

In  addition  to  sufficient  coherence,  for  a  pulsed  laser,  the  pulse 
length  must  be  much  greater  than  the  depth  of  the  Illuminated  region. 
This  requirement  ensures  that,  for  most  of  the  pulse  length,  light  will 
be  arriving  at  the  receiver  from  all  parts  of  the  target  region.  If 
this  Is  not  the  case,  then,  as  mentioned  In  the  previous  paragraph,  the 
far-fleld  complex  amplitude  will  not  represent  a  Fourier  transform  of 
the  target. 

9.5  NUMERICAL  EXAMPLE  OF  A  SYSTEM  DESIGN 

For  a  numerical  example  of  a  system  design,  the  system  performance 
parameters  were  chosen  to  be 

Resolution,  d  *  0.1  m 
Number  of  resolution  elements 

|  In  Image,  N  by  N  ■  128  by  128 

j  RMS  error  In  the  Image,  e  ■  10*  . 

i 

i 

The  following  system  design  parameters  were  chosen  or  assumed: 
Transmitter: 

Laser  wavelength,  X  ■  10.6  fi m 
Laser  pulse  length,  Ar  -  1  n sec 
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Laser  spatial  and  temporal  coherence  >  100  m 

(these  requirements,  which  are  easily  achieved 
by  a  COg  laser,  assure  that  a  cube  12.8  meters 
on  a  side  can  be  Imaged) 

Transmitter  optical  efficiency,  tt  ■  0.5 


Target: 

Target  range,  R  ■  10  km 
Target  reflectivity,  r  ■  0.1 
Illumination  pattern  taper 

In  Image  resolution  elements,  n  ■  6 

Optical  path: 

Two-way  path  transmittance,  exp (-2)  «  0,135 
(a  ■  O.l/km) 

Receiver: 

Receiver  optical  efficiency,  tr  *  0.5 
Number  of  detector  elements,  256  by  256 
Detector  type:  lens  let  array 
Detector  quantum  efficiency,  rj  *  0.5 
Detector  0*  ■  1  x  1011  cmCHzJ^/Watt 

(HgCdTe  at  77*K  with  an  f/4  lenslet  array) 

From  the  equations  derived  earlier,  the  following  dependent 
parameters  are  then  determined: 

Transmitter  aperture  diameter,  Dt  ■  0.26  m 
Receiver  aperture  diameter,  Dr  ■  1.06  m 
Detector  collection  area  diameter,  Dr/2N  ■  4  mm  . 

For  an  Image  error  of  10%  and  an  Illumination  pattern  taper  of  6  Image 
resolution  elements,  the  required  SNR  In  the  far-fleld  Intensity  data 
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has  been  shown  by  computer  simulations  to  about  10  to  1.  For  Infrared 
detectors,  the  dominant  noise  Is  thermal.  Using  the  definition  of  D  , 
the  noise  equivalent  received  energy,  En,  Is 

E 

Assuming  the  detector  width,  Aw,  Is  50  /»m,  then  En  -  3.5  x  10“17  Joules. 
This  corresponds  to  a  thermal  noise  standard  deviation,  n^.,  of  940 
electrons.  The  laser  pulse  energy,  Et,  required  to  produce  a  mean 
signal  10  times  as  large  (for  a  SNR  of  10  to  1)  Is  5  Joules.  The 
standard  deviation,  np,  of  the  mean  signal  due  to  Poisson  statistics  Is 
100  electrons  which,  as  expected,  Is  much  smaller  than  nt. 

9.6  CONCLUSION 

The  practicality  of  the  reduced  tolerance  Imaging  concept  for 


tactical  air-to-ground  or  ground-to-air  Imaging  using  active  laser 
Illumination  has  been  Investigated.  The  key  system  performance 
parameters  were  defined  and  transmitter,  receiver,  and  data  processing 
algorithm  architectures  were  established.  Relationships  were  determined 
between  these  performance  parameters  and  system,  target,  and  optical 
path  parameters.  The  main  design  tradeoffs  Identified  were  transmitter 
aperture  diameter  versus  Illumination  pattern  taper,  and  laser  pulse 
energy  versus  the  size  of  the  region  to  be  Imaged.  The  practicality  of 
an  Imaging  system  at  10.6  /im  was  explored  through  computation  of 
numerical  values  for  the  system  parameters  using  realistic  values  for 
system  performance,  optical  path  and  target  characteristics,  and 
infrared  detector  performance.  For  the  example  given,  the  largest 
coherent  aperture  required  (for  the  transmitter)  was  only  one-fourth  the 
diameter  (one-sixteenth  the  area)  of  the  coherent  aperture  we  would  have 
needed  to  form  a  diffraction-limited  Image.  Based  on  this  analysis, 
reduced  tolerance  Imaging  continues  to  offer  the  possibility  of  reducing 


cost,  weight,  complexity,  and  aerodynamic  effect  of  the  Imaging  system 
and  also  the  poss 1 bl 111 ty  of  diffraction-limited  Imaging  through 
turbulence. 
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Appendix  A 


PARAMETER  ESTIMATION  AND  THE  CRAMER-RAO  LOWER  BOUND 

A.l  INTRODUCTION 

A.1.1  The  Estimation  Problem 

The  problem  we  consider  is  that  of  estimating  a  parameter  vector 
&  —  [au  .  . . ,  aL)  from  a  measurement  vector  R  —  (R i,  .  .  .  ,  R^).  An  estimate 
is  a  random  vector  A  —  (A  t, . .  . ,  A^,),  where  each  A  ,•  is  intended  as  an  approx¬ 
imation  to  the  corresponding  a^.  An  estimator  is  a  function  A(‘)  mapping  meas¬ 
urements  into  parameter  values.  The  estimator  A(‘)  produces  the  estimate 
A  —  A  (R)  of  &  •  It  is  assumed  that  the  conditional  probability  density  p[r.  Id) 
is  known. 

A  word  on  notation  is  in  order.  In  general,  we  will  use  upper  case  letters  to 
denote  random  variables  and  vectors  and  will  use  lower  case  letters  to  denote 
their  possible  values  and  also  most  non-random  variables  and  vectors.  Vectors 
will  be  underscored.  Thus,  the  received  vector,  which  contains  randomness,  is 
denoted  R.  A  function  of  a  random  quantity  is  also  to  be  considered  random; 
hence,  the  estimate  A  is  capitalized.  As  described  below,  the  parameter  vector 
to  be  estimated  may  be  random  or  not.  In  the  previous  paragraph,  we  elected  to 
use  the  lower  case  a  •  The  probability  density  of  a  random  vector  such  as  R  will 
be  denoted  or  simply  p(z),  when  no  confusion  can  arise.  A  conditional 
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probability  density  such  as  that  of  &  given  A  will  be  denoted  Pg  14(11  ji)  or 
pit  |  i )•  The  average  of  a  random  quantity  will  be  denoted  either  E ]  or 
2J,t[aj.  The  latter  is  especially  helpful  for  expressions  like  EA  (dp^  (a  )/da  ],  which 
otherwise  would  have  to  be  written  E[dpA(A  )/dA  |. 

We  consider  two  distinct  estimation  environments  that  differ  primarily  in 
their  prior  knowledge  of  the  parameter  vector  1  to  be  estimated. 


Environment  Is  Nonrandom  Parameters 


Here,  the  parameter  vector  to  be  estimated  is  a  fixed  but  unknown  vector  4 , 
and  the  quality  of  an  estimator  X  (•)  is  judged  by  the  resulting  mean  squared 
error  functions 


-«<)*],  »  -1,2 . 


(A-l.l) 


As  these  mean  squared  errors  ordinarily  depend  on  the  actual  values  of  the 
parameters  4,  there  is  not  usually  a  “best”  estimator.  Rather  some  estimators 
are  better  for  certain  values  of  a  and  worse  for  others. 

A  frequently  used  estimator  for  this  environment  is  the  maximum  likelihood 
(ML)  estimator,  which  chooses  A  (1)  —  4  if  p(l  U)  is  largest  among  all  choices 
for  jj. 

An  estimator  is  unbiased  if  E  [;$  i  ]  —  a, ,  i  =  1,  .  .  . ,  L .  The  ML  estimator 
might  or  might  not  be  unbiased.  In  some  special  cases  there  exists  an  unbiased 


—  "  >  lT— J - 


estimator  A  V)  that  is  best  in  the  sense  that  for  any  other  estimator  A  , 


$tl>- £1(1  *(£)-■<)  “  1  <S.U)~£[U  (£)-«, )Z  I  ,  for  all*  and  i  -  1,  ... 

(A- 1.2) 


Environment  2s  Random  Parameters 

Here,  the  parameter  vector  to  be  estimated  is  a  random  vector  A  with 
known  probability  density  p(a).  The  quality  of  an  estimator  is  judged  by  the 
mean  squared  errors 

Si  -  E  {(A  ;  -  A<)3]  ,  i'  -  1,  . .  . ,  L  .  (A-1.3) 

In  this  environment,  there  is  a  best  estimator,  namely,  the  mmimtim  mean 
squared  error  (MMSE)  estimator: 

Ifi  -£]  •  (A-1.4) 

An  estimator  in  Environment  2  is  said  to  be  unbiased  if  E  [A ,  j  *«  /?[/!,], 
i  =  1 The  MMSE  estimator  is  unbiased. 

A.1.2.  The  Cramer- Rao  Lower  Bound 


In  either  environment,  there  is  a  Cramer-Rao  lower  bound  (CRLB)  to  the 


Environment  It  For  any  unbiased  estimator, 


*((*,  “*i)2l  >  Mi1  ,  i  -  I,--  -,L  (A-1.5) 

where  J  is  the  L  XL  matrix,  sometimes  called  the  Fisher  information  matrix, 
with 


</,y  “  Ei 
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^  lnp(£  U)^g7  InPU  U) 


(A-1.6a) 
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da,  day 


lnp(l  U) 


(A-1.0b) 


and  where  the  expectations  average  over  R  ■  The  Cramer-Rao  bound  also  gives 
the  more  complete  result 


C>/-‘  ,  (A-1.7) 

where  C  —  [<7<y)‘is  the  covariance  matrix  of  the  errors  (Cty  —  E  [(A  ,  -a<  y -ay )] 
and  where  C>7'1  means  that  C~J~l  is  a  non-negative  definite  matrix.  (A 
matrix  M  is  non-negative  definite  if  i1  Mi>Q,  for  every  column  vector  i).  Note 
that  C  >JA  implies  <?,,  >[/]£*,  so  (A-1.7)  subsumes  (A-1.5). 


Environment  2:  For  estimators  having  a  certain  “unbiaaed'like” property, 

B[{Ai-Aif\Z[7  +  K\iil,  i~l,...,L  ,  (A-1.8) 

where 

li  -EM I  (A-1.8) 

and 
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Kij  -  Es 


(A- 1.10a) 
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in  p  U ) 


(A-l.lOb) 


In  this  environment,  the  expectations  are  average  over  &  and  A*  Id  addition 
there  is  also  the  more  complete  result: 


(A-l.ll) 

where  C  «*  [C,y]  is  the  covariance  matrix  of  the  errors, 
( C\j  **  E[{A  i-AiXA  j-Aj)\).  In  the  scalar  case  (i.e.,  L«l)  the  “unbiased-like” 
property  which  the  estimator  must  satisfy  is 


4limooPA(fl)(£?fl|^(x)M,-« ]-e)-0  .  (A-1.12) 

Notes: 

(1)  The  existence  of  the  above  derivatives  is  presumed. 

(2)  The  matrices  J  and  K  are  functions  of  the  likelihood  distribution 
p(l  U)  and  the  a  priori  distribution  p(j),  respectively.  J  is  related  primarily  to 
the  likelihood  distribution  p (i  |a)  and  secondarily  to  the  a  priori  distribution 
PU). 
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A.2.  SCALAR  PARAMETERS 


A.2.1.  In  this  section  we  consider  the  case  where  the  parameter  to  be 
estimated  is  a  scalar  A  (i.e.,  L  **  1).  The  measurements  may  be  a  vector  or  a 
scalar.  In  this  case  the  Cramer-Rao  lower  bounds  simplify  as  shown  below. 


Environment  It 


where 


(A-2.1) 


-  Er  lfl  P (x  I  * )  “  [  ( Jaln^(£  I  *))  ]  *  (A-2.2) 


Environment  2t 


B[(A  ~A)2\> 
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where 


J  -  Ea  [/)  -  -  ERiA  j-, ^In  p(i  |  «)  -  JS&,a  (jjn  p(i  |  a)j  (A-2.4) 
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8s 

h 


-Ba  [  (-£l“P(>))  ]  -*A  [  Tj  )  ]  • 


(A-2.6) 


where  pfA  (a )  denotes  the  derivative  of  p*  (« )  with  respect  to  a . 
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Note  that  if  in  Environment  2  one  has  to  estimate  a  parameter  vector 
A.  —  {A  i  Al  )  from  a  measurement  vector  fi ,  one  may  separately  apply  the 
bound  in  (A-2.3)  to  each  component  of  A  and  obtain 


Etft  -■*/)’]>  +  {■£■  eW 
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-» 


(A-2.6) 


This  bound  is  presumably  simpler  to  compute  but  not  as  good  as  that  of  Eq.  (A* 
1.7). 


A.2.2  Additive  noise  problems)  Calculation  of  J 


•  ni 


(a)  Suppose 

R  -  /  (A  )  +  N  ,  (A-2.7) 

where  /  is  some  function  and  N  is  some  noise  that  is  independent  of  A  with 
density  pw(n ).  Then  for  estimating  A  from  R , 

■  <A-3'8> 

where  p'/v(«)  and  /'  (a)  denote  the  derivatives  of  p/v(»)  and  /(a)  with  respect 
to  n  and  a ,  respectively. 


(b)  Suppose  (A-2.7)  holds  and  N  is  Gaussian  with  mean  mN  and  variance 
0N\  >•<*•»  iitnN,ifN)>  Then 
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J 


(A-2.9) 


~  -1  (/  '  (a))2 
*N 

(c)  Suppose  (A-2.7)  holds  with  N  Gaussian  i i(mN,a^)  and  with 


/(a)  =  ca+d  .  (A-2.10) 

Then 

J  ““  A  (A-2.11) 

Notice  that  J  does  not  depend  on  a .  For  this  ease  the  ML  estimator  is 

■?(')■“ -^r  ■  (A-2.12) 

c 

which  is  unbiased  and  results  in  mean  squared  error 

E  [(^(tfWJ2]--^  ,  (A-2.13) 

°N 

which  in  turn  coincides  with  1/ J,  the  Cramer*Rao  lower  bound  to  mean  squared 
error.  Thus,  we  draw  two  conclusions:  the  ML  estimator  is  optimum  in  the  sense 
that  no  unbiased  estimator  has  smaller  mean  squared  error,  and  the  Cramer-Rao 
bound  is  tight 

A.2.3  Environment  2:  Calculation  of  K 

If  A  is  GaussiftD  timA  ,trj[  j,  then 


K 


(A-2.14) 
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A.2.4  Linear  Gaussian  Problem:  Environment  2 

Suppose 

R  =*  cA  +  d  +  N  ,  (A-2.15) 

where  A  and  N  are  independent,  Gaussian,  n(mA  ,(t\  )  and  i K  »»>/,»£),  respectively. 
In  this  situation,  the  Cramer-'Rao  lower  bound  again  turns  out  to  be  tight,  i.e.,  it 
equals  the  mean  squared  error  of  the  best  estimator.  Specifically,  the  best  esti¬ 
mator  A  *(R )  *=  £?[.4  |  R 1  turns  out  to  be  the  ML  estimator 

A  *(*)«•  ,  (A-2.16) 

C 

and  the  resulting  mean  squared  error  is 
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A.3  INDEPENDENT  MEASUREMENTS*  THE  J-MATRDC 


A.3.1  Conditionally  Independent  Measurements 

Let  us  consider  the  commonly  occurring  situation  wherein  the  measurements 
Ru  . . . ,  Rm  are  conditionally  independent  given  the  parameter  vector  That 
is, 

M 

pr\a(l  li)  -  II  Pk(n  U)  .  (A-3,1) 

i-i 

where  for  brevity  we  write  p*(r*  |  j)  for  p ^  j^(r*  |.a).  We  may  think  of  each 
as  an  independent  measurement  of  A*  For  example,  such  conditional  indepen¬ 
dence  holds  if 

Rk  ~  fk(A  ,Nk  )  ,  *  ,  (A-3,2) 

where  /  „  .  ,  .  t  / M  are  known  functions  and  Nv  ,  .  . ,  NM  are  random  variables 
that  are  independent  of  each  other  and  of  A  • 

Then  as  shown  in  Section  A.3.3,  the  J-matrix  in  the  Cramer-Rao  bound  for 
estimating  A  based  on  fl  is 

J  “  £  J(k)  ,  (A-3.3) 

t-i 

where  is  the  y-matrix  for  the  Cramer-Rao  lower  bound  to  the  MSE  for 
estimating  A  based  on  /?* ;  that  is, 
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I  I 

«J,|* }  -  ERi  ~  In  p*  (fi  U )  ~-  In  p*  (rA  U )  .  (A-3.4) 

Equation  (A-3.3)  shows  that  each  independent  measurement  R i  has  an  additive 
effect  on  J .  The  K -matrix  (in  Environment  2)  has  the  usual  form. 


One  specific  example  occurs  frequently,  namely, 

“  /*  (A )  +  A i  ,  k  —  1, . . . ,  M  , 


(A-3.5) 


where  /  „  .  .  . ,  /  w  are  known  functions  and  Ny, . . . ,  NM  are  independent  of  A 


and  each  other.  Then  using  the  fact  that 


one  finds 


Pki'k  U)-Pty(»W*U))  . 


•  8’ 

it  \  P  A4  ( ^k )  $  »  t  \  d  ,  ,  . 

|t5T57]  k; /,(4);k; /,u) 

•  « 


(A-3.0) 


(A-3.7) 


If  in  addition  Nk  is  Gaussian  then 


4"  -4  /*<*)—  /.a)  • 


A.3.2  One-for-one  Independent  Measurement* 


(A-3.8) 


Let  us  here  consider  the  situation,  henceforth  referred  to  as  ont'fot'ont 
independent  measurements,  where  there  are  equal  numbers  of  measurements  and 
parameters  to  estimate  (i.o.,  L  **M)  and  where  in  addition  to  (A*3.1)  we  have 


Mb 


n  Pi(^l«»)  »  (A-3.«a) 

k  -«1  j 

j 

f 

or  equivalently,  J 

i 

| 

j 

Pnk  U  0 rk  I A )  =  Pk  (r*  I  «* ) .  (A-3.9b) 

! 

i 

A  ’j 

where  p*(r*  |  a*)  —  ( ( r*  |  <r4)  —  pffl  |A(r4  |jj).  That  is,  each  measurement 

/?*  depends  only  on  the  corresponding  parameter  Ak .  Therefore,  we  may  think 

I 

of  each  Rj,  as  an  independent  measurement  of  A 4 .  This  situation  occurs,  for 
example,  when 

j 

i 

Rk  ,  h  ,  (A-3.10) 

1 

for  some  functions  /w  and  random  variables  Nx,  , ,  .  ,NM  that  are 

i 

independent  of  each  other  and  of  A* 

Environment  1. 

In  the  case  of  one-for-one  independent  measurements  in  Environment  1,  each 
parameter  Ak  may  without  loss  of  optimality  be  estimated  from  Rk  alone;  i.e., 
without  using  Rj,jyikt  and  without  regard  to  the  joint  distribution  of 
(A  j,..., Ay),  We  will  now  show  that  the  Cramer-Rao  lower  bound  reflects  this 
fact  by  reducing  to  the  scalar  lower  bound. 

It  follows  from  (A-3.3),  (A-3.4)  and  (A-3.9b)  that  «•  0  unless 
1  r«  j  r.  k .  Hence,  J  is  diagonal.  Specifically, 
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.  <v '  uv  yv w ».■%  «wt<y«  «j  i/vi.**  yy  v*  ;■»  nm/y  wijjigi  ><yn,WV'.Jy  i «. 


0 


(A- 3.11) 


where 


jkk  mm  jtf)  =-  J  In  pk(rk  |  A|)J  J  .  (A-3.12) 


Substituting  (A-3.11)  into  (A-1.5)  shown 


(A-3.13) 


which  is  exactly  the  same  bound  that  one  would  get  by  applying  the  scalar  lower 
bound  (A-2.1)  to  the  problem  of  estimating  Ak  from  R k  alone. 

One  commonly  occurring  instance  of  one-for-one  independent  measurements 


is  where 


Rk  -/*(**)  +  A*  ,  *  , 


(A-3.14) 


where  the  Fk  's  are  known  functions  and  the  A*  ’s  are  independent  of  each  other 


and  of  A, .  In  this  cose 


Pki'k  I  «* )  —  PAT* ( »•*  -  /*(«*))  • 


(A-3.15) 


Substituting  the  above  into  (A-3.12),  gives  (alternatively,  it  follows  from  (A-2.8)) 


******  **  **  *****  VWM4A**  MK*AftAW\  ** 


4k  *“ 


2" 

• 

PNk(nk) 

« 

(/'*(«*))' 


(A-3.18) 


Finally,  if  the  Nk ’s  are  Gaussian  then 


4  =4  (/'*(«*))’ 

ffNk 


(A-3.17) 


Environment  2 


Consider  the  situation  of  one-for-one  independent  measurements  in  Environ¬ 
ment  2.  If  in  addition  the  components  of  A  “  [Aj,...,.<4w]  are  independent,  then 
without  loss  of  optimality  one  may  estimate  each  Ak  from  Rk  alone.  That  is  the 
optimum  estimate  of  Ak  is 

Ak  -  E[Ak  |fl]  -  E[Ak  \Rk\  .  (A-3.18) 

We  now  show  that  the  Cramer-Rao  lower  bound  reflects  this  fact  by  reducing  to 
the  scalar  lower  bound. 

It  follows  from  (A-3.11)  that  7  =  EA  [7]  is  a  diagonal  matrix  with 

4k  -  lnP(r*  I  «*))  J  (A-3.19) 

In  addition,  the  independence  of  imply  that  th*»  K  matrix  is  diagonal 

with 

Kkk  =  BAi  ^  lnp(at)j  |  .  (A-3.20) 

Thus  [7  +  K]  and  [7  +  A)"1  are  diagonal  matrices  and  one  obtains  the  Cramer- 


| 
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Rao  bwer  bound 


E[  I'1*-'4*)’]  2  [?  +  «]„  (A-3.21) 

=  ***  [  («|  P(,‘  1  "  >)  ]  +  **  [  ln  p (a* > )  ■ 
which  is  exactly  the  same  bound  that  one  would  get  by  applying  the  scalar  lower 

bound  (A*2.3)  to  the  problem  of  estimating  from  /?*  alone. 

If,  however,  the  components  of  A  are  dependent ,  then  the  best  estimate  of 
each  At,  will  depend  on  other  Rj's  besides  /?*.  Intuitively,  we  may  think  that 
Rj ,  j^Ak ,  contains  information  about  Aj,  which  in  turn  has  information  about 
Ak .  This  dependence  is  reflected  in  the  Cramer-Rao  lower  bound.  Specifically, 
the  matrix  K  and,  consequently,  [J  +  K]  will  not  be  diagonal.  Note,  however, 
that  7  will  be  diagonal,  and  this  sometimes  allows  some  simplification. 

A.3 .3  Proof  of  ( A-3 .3) 

Recall  that 


Jii  =  E 


■g-  In  p(fl  U)~  la  P(fl  U) 


Using  (A-3.1),  we  find 


J-  In  P(£E  la)  =  -57  E  U)=  E  . 

9<li  dtli  *- 1  i-1 


where  K,  *  is  the  random  variable 


(A-3.22) 


(A-3.23) 


Yi,k  \*Pk(Rk  U)  . 


(A-3.24) 


The  assumption  o?  conditionally  independent  measurements  implies  that  y(i*  ia 
independent  of  Yjj,  if  k  ^  L  Substituting  (A-3.23)  into  (A-3.22)  and  using  the 
independence  of  the  K(  || ’s  gives 

N  N  N  N 

4  -  *  E  1'i.i  E  rt>  -EE  b[k„,  !'/,<] 

i -i  r-i  i -i  i-i 


(A-3.25) 


-  E  !'>,*!+  E  E  . 

A- 1  A-i  /-I 

Vi 

We  now  show  that  the  second  term  in  the  above  is  zero.  For  any  i  ,k 

r  Pk(r  I  a) 

E[Yi,k\**f  -g-  InPA(rU)  ft(r|l)dr-J  ““~p“("r~UT"  Pl^r  I 

■  ■  ( A.9 


(A-3.26) 


/■^  Pk^r  “  jz.  Ip*(r  I *)ir 


Substituting  the  above  result  into  (A-3.25)  gives 


4  -  E  Bin.. 4,. I  -  £  b[—  top, (ft,  |.)^  lap, (ft,  U) 

k  -1  A-l  »«»  0°/ 


(A-3.27) 


-  E  4“’  ■ 

A-l 


where  /(*)  is  defined  to  be  the  /-matrix  for  estimating  a  based  on  /?* .  This 

Af 

shows  that  /  —  £  Z1*1,  which  b  the  desired  result  (A-3.3). 


A.4  ADDITIVE  GAUSSIAN  NOISE 


A.4.1  Environment  1 


Suppose 


fi-/(A)+tf  ,  (A-4.1) 

where  /  is  a  known  vector-valued  function  and  —  ( Nx , . .  . ,  N^)  is  a  vector 

of  Gaussian  noise,  independent  of  At  with  zero  means  and  covariance  matrix  CN, 

CN  “  E[M'\ ,  \Os\tf  -  EiNiNj]  .  (A-4.2) 

In  this  cose 

Pfl  IdU  U)“  Puit-f  U)) 

“  “P  (' i11'7 U))' Cn’I(w  H  • 

Substituting  the  above  into  (A- 1.6a)  and  simplifying  yields 

J  -  rc^r*  ,  (A-4.4) 

whore  r  is  the  L  x  M  Jacobean  matrix  defined  by 

/>  Cfl )  •  (A-4.5) 

If  /  (i )  is  linear;  i.e.,  if 

&  -  Fi  +  N  ,  (A-4.6) 

where  F  is  an  L  X  M  matrix,  then  r  *«  Fl  and  so 
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J  -  F*  Cn'F  .  (A-4.7) 

Note  that,  just  as  in  the  linear  scalar  case  (A-2.ll),  J  does  not  depend  on  4. 
Also,  as  in  the  linear  scalar  case,  the  ML  estimator  is  unbiased  and  achieves  the 
Cramer-Rao  bound.  Consequently,  the  Cramer-Rao  bound  is  tight.  Specifically, 
the  ML  estimator  is 

i  U)  -  |F*  Cfi'Ff'F*  Cn'j 1  (A-4.8) 

and  has  error  covariance  matrix  C  —  J"1. 

If  Ni, .  . . ,  Sfu  are  independent,  then  (A-4.4)  reduces  to  the  answer  given 
previously  in  (A-3.3)  and  (A-3.8), 

If  N\,  .  .  . ,  are  identical  as  well  as  independent,  then  (A-4.4)  reduces  to 


and  (A-4.7)  reduces  to 


A.4.2  Environment  2 


(A-4.9) 


(A-4.10) 


Let  us  again  suppose  that  (A-4.1)  holds;  i.e.,  B.  *“  /  {A)  +  jY,  where  $  is 
Gaussian  with  zero  means  and  covariance  matrix  C/v,  and  in  addition,  suppose 
that  A1  —  {A  u  . .  .  ,  Ai )  is  Gaussian  with  zero  means  and  covariance  matrix  CA  . 
Then 
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A.5  COMPLEX  PARAMETERS  AND  MEASUREMENTS 


Id  this  section  we  discuss  the  situation  in  which  either  the  parameters  to  be 
estimated  or  the  measurements  or  both  are  '<  nplex  vectors.  We  begin  with 
some  notation. 

For  reasons  to  be  seen  later,  we  will  typically  represent  a  complex  vector 
such  os  i  =*  wherein  each  component  *y  is  complex,  by  a  vector  & 

with  2M  real-valued  components:  specifically, 

h  “  (*lr  i*  li  i*2f  i*2i  >*Mi )  i  (A-S.l) 

where  *yf  and  sy<  are  the  real  and  imaginary  parts,  respectively,  of  2y .  Alterna¬ 
tively,  we  may  write 

&  "■  (litis . 1m)  >  (A-5.2) 

where  g  ,  (*yr  ,*;i)  is  the  vector  representation  of  *  y  —  +wy< ).  We  will 
often  write  x}U  os  generic  notation  to  stand  for  either  or  zj{ ,  It  will  occasion¬ 
ally  be  convenient  to  flip  back  and  forth  between  the  two  notations  g  and  £, 

depending  on  what  needs  emphasis  and  which  requires  fewer  symbols.  For  exam¬ 
ple,  when  magnitude  or  length  are  relovant,  then  the  magnitude  |  2,  |  and  the 
length  ||iy  ||  both  denote  s/xJ+Tjf,  but  the  former  is  simpler  to  write. 
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Note  further  that  the  “lengths"  of  the  vectors  i  and  &  are  identical: 


A.6.1  Complex  Measurements 

When  the  measurements  form  a  complex  vector  £  —  (£  !,..,,/?*/),  we  will 
represent  them  as  a  real  vector  £  ■*  (ftir  ).  For  one  thing  this  allows 

there  to  be  a  well-defined  probability  density  p(&  |j). 

A.6.2  Estimating  Complex  Parameters 

When  the  parameters  to  be  estimated  form  a  complex  vector  A  “  A  . . AL ), 

we  will  separately  estimate  the  real  and  imaginary  components  of  each  Aj .  That 
is  wo  will  represent  A  hy  the  vector  —  (A  lr  2r )  with  real  com¬ 
ponents  and  produce  an  estimate  ^  “  (A ^u)  a=  (An  I2 . A/,)-  If  so 

desired,  A  can  be  converted  to  a  complex  estimate  A  *==  ^ )  for  A  . 

The  mean  squared  errors  in  A  are  related  to  those  in  A  by 

|  Aj  -A  j  | 2  —  ( Ajt -A  i  ,r  )  V  [Aji -A  ji  )  —  1 1  Aj  -A/  1 1 2  •  (A-5.4) 

Note  that  the  Cramer-Rao  bound  requires  differentiation  of  a  certain  func¬ 
tion  with  respect  to  the  parameters.  This  makes  sense  only  if  the  parameters  are 


real  and  provides  further  motivation  for  replacing  A  by  A.  Specifically,  the 
Cramer-Rao  bound  for  estimating  A  from  a  real  vector  R  is,  in  our  notation, 

Environment  1: 

)*]>M£U  ,  *  “  ,  u  €{r ,i }  ,  (A-B.5) 

where 

•  a 

4<>  -Er  —  In  PU  I  >“  P(l  1 1)  .  (A-5.6) 

»  a 

Environin'  nt  2 1 

ElUku-AtJ2!  >  [J+KliU  >h  -  1 . M,  «€{v,0  ,  (A-5.7) 

where 

*4«,/«  ""  ^*4,  [*4a,/»  ]  (A-5.8) 

»  a 

Kk a>  —  Ea  lnP(l)  •  (A-5.9) 

I  • 

A.6.3  Independent  Measurements 

Let  us  consider  the  commonly  occurring  situation  in  which  the  components 
of  the  complex  measurement  vector  R  =(RU...,RM)  are  conditionally  indepen¬ 
dent  given  the  real  parameter  vector  A  —  {A  it.,,,AL).  With  R  represented  by 
the  real  vector  R  —  (Rlr,Rli,...,Rl^)1  the  conditional  independence  means 


M 

(r,U)sa"45lp*U*  U)  *  (A-5.10) 

where  pk(L  k  I  & )  is  shorthand  for  p&i  (x  *  |  n ).  This  happens,  for  example,  if 

Rk=fk(A,Nk)  ,  . M  ,  (A-5.11) 

where  /t,.,.,/w  are  known  complex-valued  functions  and  are  complex¬ 

valued  random  variables  that  are  independent  of  each  other  and  of  A  •  The  con¬ 
ditional  independence  considered  here  is  but  a  slight  generalization  of  that  con¬ 
sidered  in  Section  A.3.1. 

Arguments  similar  to  those  used  in  Section  A, 3.3  show  that  the  J  matrix  of 
the  Cramer-Rao  lower  bound  to  the  MSB  when  estimating  A  based  on  &  reduces 

to 


u 


j  -  i  , 

4-1 


(A-5.12) 

where  is  the  J-matrix  for  estimatings  based  on  B.  i  ■=*  (/?*r )>  that  is, 


Jif* 1  =“  , 


In  Pk (l  k  U )  In  PkU  4  U ) 


(A-5.13) 


Equation  (A-5.2)  shows  that  the  effect  of  each  complex  measurement  is  additive. 

A.B.4  One-for-one  Independent  Measurements 

Let  us  now  consider  the  situation  in  which  there  are  M  complex  parameters 
A  —  (A  i,...,Am),  M  complex  measurements  R  «■*  and  each  measure¬ 

ment  depends  only  on  in  the  sense  that 
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or  equivalently, 


p(l  k  I  i)  -  PkU  k  U  *)  .  (A-5.14b) 

where  pk{, t  k  1 4  * )-  Pg  *  |  g  *  U  *  U  * )  -  Pfi  4 1  a  (t  *  I  a>  This  happens,  for 
example,  if 

*4  ,  (A-5.15) 

where  / 1,..*,/ a/  are  known  complex-valued  function  and  Nu..,,Nm  are  complex¬ 
valued  random  vectors  that  are  independent  of  each  other  and  of  A, 

This  cose  is  also  a  special  case  of  that  considered  in  Section  A.5.3.  Restating 
the  result  given  there  for  the  present  case  gives 

J  am  ^  J ,  (A-5,16) 

4-1 

where  J is  the  /-matrix  for  estimating  based  on  &  —  That  is, 

•  ■ 

-n—  la  PkU  k  1 1)  flT"  In  P*U  *  I  i)  . 

aamH  uant 

(A-5.17) 

m,ne{l AT > ,  u, «€{«■•}  . 

Substituting,  (A-5.14b)  into  the  above  gives  —0  unless  m  —  n  —  Jfe. 

Hcnco  /  is  tri-diagonal.  Specifically, 


r(l) 


'(2) 


0 


0 


J(M)  J 


(A-5.18) 


where  /(*)  is  the  2  X  2  matrix 


M 


ut» 


Ei 


R 


—  In  p(l  k  lnP(H  Ua) 

°aku  ®flA* 


,  «,w6{r,f }  . 

(A-5.19) 


Here,  J(k)  is  the  ^-matrix  in  the  Cramer-Rao  lower  bound  for  estimating 
based  on  B.  k  - 


A.0  CRAMER-RAO  BOUNDS  IN  THE  PRESENCE  OF 
AMBIGUITY 


Consider  the  additive  noise  estimation  problem  where 

fi-/(A)  +  tf  •  (A-6.1) 

When  /  is  not  one-to-one,  we  will  say  there  is  ambiguity  in  the  function  /  and 
the  measurement  R .  In  such  cases,  one  cannot  expect  any  estimator  to  perform 
well.  Unfortunately,  however,  the  Cramer-Rao  bound  may  not  reflect  this,  i.e,  it 
may  give  a  very  low  MSE  even  though  the  actual  MSE  is  rather  large.  We  illus¬ 
trate  this  phenomenon  in  the  scalar  case  and  show  that  it  is  not  a  problem  in  the 
linear  Gaussian  case. 
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Example  1:  Scalar  Parameters 

One  may  see  from  (A-2.8)  and,  for  the  Gaussian  case,  (A-2.9)  that  the  CELB 
'will  be  the  same  for  any  function  /  whose  derivative  has  the  same  magnitude  as 
that  of  /  .  To  get  a  clearer  picture,  consider  the  situation  where  /  has  a  con¬ 
tinuous  second  derivative  but  is  not  one-to-one.  Then  /  has  an  interval  where  it 
is  constant  (/  1  =»  0)  and/or  a  pair  of  intervals  /+  ,  l.  such  that  /  increases  on  /+ 
{f  1  >  0)  and  /  decreases  through  the  same  range  on  /.(/'<  0).  The  CRLB 
will  be  made  large  (and  appropriately  so)  by  the  intervals  where  /  '  =  0.  On  the 
other  hand,  it  will  not  be  affected  by  the  existence  of  pairs  of  intervals  where  / 
increases  and  decreases,  respectively,  through  the  same  range,  i.e.,  it  will  be  as 
small  as  the  CRLB  for  the  nonambiguous  function 

/(•)=/  l/'(«)l*  .  (A-6.2) 

-00 

for  which  one  expects  there  are  estimators  with  much  lower  MSE  than  for  /  . 
Thus,  we  conclude  that  in  the  presence  of  ambiguity,  the  CRLB  can  only  be 
expected  to  give  a  reasonable  lower  bound  to  the  MSE  for  the  estimation  problem 
involving  /  . 

As  a  concrete  example,  compare  the  estimation  of  A  based  on  either  but  not 
both  of  the  following  two  measurements 

Ri  =  /i(A)  +  N  ,  =  f^A)  +  N  ,  (A-6.3) 

where  /l(a)  =  a3,  /2(a)=  |  «3 1 ,  A  and  N  are  Gaussian,  independent  and 


»K°i  <*a)i  *K°>  °n\  respectively.  There  is  obvious  ambiguity  in  the  R2  measure¬ 
ment.  For  either  measurement,  the  CRLB  gives 


E[(A  -Af ]> 


_ 1 _ 

27 o\l  oft  +  1/  oft 


(A-6.4) 


This  is  a  reasonable  bound  for  estimation  based  on  R  (It  tends  to  0  as  oft  -*  0; 
it  tends  to  oft  as  oft  -* oo.)  It  is  not  reasonable  for  estimation  based  on  R2. 
Indeed  if  oft  —  0,  then  the  best  estimate  for  A  based  on  R2  is 
X  =  E  {A  1  R  2\  =  0  with  MSE  =■  o ft ,  whereas  the  CRLB  lower  bound  is  zero. 


Finally,  consider  estimation  based  on  R2)  when  the  density  of  A  is  replaced 


by 


^(<0 


a  >  0 


(A-6.5) 


( 0  ,  a  <0  , 

Now  the  increased  a  priori  information  removes  all  ambiguity  (at  least  with  pro¬ 
bability  1),  and  the  CRLB  (which  is  the  same  as  before)  is  a  reasonable  bound  to 
MSE  based  on  R2.  Thus,  we  see  that  additional  a  priori  information  can  modify 
the  problem  so  the  CRLB  is  useful. 

Example  2:  Linear  ID  Gaussian  Case 
Here, 
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B.-FA+&  ,  (A-8.6) 

where  R  -  (Rv  .  . . ,  fl*/)1.  A  —  (-4  u  .  . . ,  ^ )‘ ,  N  —  (Nlf .  .  .,WlV)‘,  F  is  an 
M  X  L  matrix,  and  A  and  #  are  IID  Gaussian  g(0,  <rjj)  and  »j(0,  <r$),  respectively 
and  independent  of  each  other.  As  mentioned  in  Section  A.4  the  CRLB  bound  is 
tight  and  gives 


EUi-Xi)2] 


Eli  +  J. 

*.4 


-1 


(A-6.7) 


If  F  has  rank  less  than  L  (for  example,  if  M  <  L ),  there  will  be  ambiguity  in  R . 
Nevertheless,  the  CRLB  is  tight. 


As  a  concrete  example,  suppose 


R  ■*»  A  i  +  A  2  +  N  , 
so  that  L  —  2,  A/  =  1,  F  —  [1  1],  Then 


(A-6.8) 


J+K  -  -L 


1  1 
1  1 


1  0 
0  1 


(A-6.9) 


[7+/f]-1  - 


9h  +  2crj 


*N  + 


*1  °N  *N 


(A-0.10) 


and  it  is  easy  to  show  that 


E[(Al-Al)2]^E\(A2-;,  2f) 


+  2<r  j 


(A-0.11) 


Note  that  as  <7$  — *  0,  MSE  -*  o\!  2,  and  as  <7$  -*  oo,  MSE  -*  . 
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Appendix  B 


A  GENERAL  APPROACH  TO  LOWER  BOUNDS 
TO  PHASE  RETRIEVAL  ERROR 

Introduction  to  the  Estimation  Problem 

The  problem  is  to  estimate  a  complex-valued  image  g  from  measurements  of 
its  Fourier  transform  in  the  presence  of  additive  noise  and  random  phase.  The 
goal  of  this  section  is  to  compute  lower  bounds  to  the  least  possible  mean  squared 
error  in  such  estimates.  Specifically,  the  image  is  a  complex-valued  m  X  m 
array 

9  —  [0,-  P  »  (P i>Pi) ,  Pi  “  0,  1, ... ,  ro-1  ,  p2  —  0,  I, .  .  . ,  m-1] 

For  conciseness  let  A/  {0,  1,  . . . ,  m-1},  and  let  A/a  —  M  X  M,  so 
0  —  [g, :  P  6  A/2].  The  image  g  is  weighted  by  the  elements  of  a  real  valued 
non-negative  weighting  array  w  =>  [i r,:p.6M!],  forming  /  =*[/,:  p  6  M2\ 
according  to 

•  (B-l) 

The  special  case  in  which  the  wf  ’s  are  0  or  I  is  of  particular  interest. 

The  measurements  upon  which  the  estimates  of  g  are  based  form  a 
complex-valued  array  5  «■  \Sk :  k  —  (kuk2)  6  A/2],  where 

,  (B-2) 

where  N  *=»  [Nk :  A  g  A/2]  is  an  array  of  complex-valued  additive  random  noise, 
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$  _  [<j>i :  fc  €  A/2]  is  a  real-valued  array  of  random  phases,  F  —  [F* :  *  €  A/2)  is 
the  Fourier  transform  of  /  ,  and  A  is  a  known  complex-valued  function.  We  will 
shortly  discuss  each  term  in  (B-2)  in  detail,  but  before  doing  so,  we  introduce  the 
vector  notation  to  be  employed. 


Vector  Notation 

It  will  be  convenient  to  employ  one-dimensional  vector  notation  for  the  vari¬ 
ous  arrays  and  matrix  notation  for  the  various  linear  transformations.  For  exam¬ 
ple,  a  complex-valued  m  X  m  array  such  as  /  *»  \/p :  p  €  M 3]  will  be 
represented  as  a  real- valued  2m9-dimensional  column  vector  /,  whose  components 


are  the  real  and  imaginary  parts  of  the  components  of  /  ,  ordered  as  shown 
below: 


where  / pr 


/  "■  (/  00r  *  /  Wi  *  /  Olr »  /  01(>  •  *  •  >  t / 

“  (XoOiXoi*  •  •  •  >  Xm-l.m-t)  * 

and  ffi  denote,  respectively,  the  real  and  imaginary  parts  of  / 


(B-3) 

p  ,  and 


where  X,  —  (/,r By  way  ot  comparison  note  that  /,  ■=(/„  +•'/,<).  Note 
also  that  the  complex  number  J,  and  the  two-component  vector  X*  *  consisting 
of  its  real  and  imaginary  parts,  represent  the  same  object.  It  will  be  convenient 
to  flip  back  and  forth  between  the  two  notations,  depending  on  what  needs 
emphasis  and  what  notation  requires  fewer  symbols.  For  example,  the  magnitude 
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|  fp  |  and  the  length  1 1 X#  )  I  give  the  same  value,  but  the  former  is  simpler  to 
write  and  so  will  almost  always  be  used.  We  use  similar  notation  for  other 
arrays,  such  a a  g ,  F  and  S. 

The  Weighting  Army 

The  effect  of  the  weighting  array  w  may  be  characterized  as  a  matrix  multi* 
plication: 

L~Wl  (B-4) 

where  W  *  p,q  €  Af2;  u,t>  €  {r ,» } )  is  the  2 m2  X  2m2  diagonal  matrix 

with 

* 

»oo  0 

«Q0 

w  -  w0l 

U>01 

0 

i.e., 

«V  .  p  -  q ,  u  —  v 
0  ,  p  yL  q  or  u  v 

For  brevity  let  M2  «  A/2  X  {r,i },  so  W  —  [  IV,  t, :  t  ,t  6  M2]. 


The  Fourier  Transform 


The  Fourier  transform  relations  are 


F.  ~i  E  /,  <xp(-.'i’(»,p)),  i  6 
m  ,ew.  m 

/#»-■“  E  **  «XP{‘~  (*  »P  )>  .  P  €  A/2  , 
m  4eA/«  m 


(B-7a) 

(B*7b) 


where  (A,p)  —  (*tPi+A2Pa).  Note  that  with  the  above  definitions,  Paraeval's 
relation  takes  the  form 


E  ima~  e  \f,\2 

*ew* 


(B-8) 


The  Fourier  transform  may  also  be  characterized  as  a  matrix  multiplication; 


where  T  —  ( T,t :  »,t  €Ma], 


F-Tf-TWg  , 


Too  .00  Too  .01  Too  .03 
Toi,00  Tot, 01 


(B-9) 


(EM0) 


T*,ji  — 


T*r  ,fr 

Tkrlfi' 

’  c(p,A)  *(p,*)' 

Tkijr 

TUji 

< 

.-#(p,*)  eCp.A), 

(EMI) 


-Jl  «4  «  4  •••  a.4  1 


c(p,k) 


(B-lSa) 


m  t  m  | 

sin  I -2? (p ,* ) 
nt  j  fti 


(B-  12b) 

The  matrix  T  is  orthogonal;  i.e.,  its  rows  (columns)  are  orthonormal,  T 1  —  T“l, 
and  it  follows  that  1 1  Tf  \  \  —  1 1  /  1 1  for  any  /,  which  is  simply  Parseval's  rela- 

r+j 

lion  (B-5). 


The  Random  Phase 

The  effect  of  the  random  phase  array  ♦  can  also  be  expressed  as  a  matrix 
multiplication.  Indeed  the  complete  measurement  can  be  expressed  as 

S  ~  H{PF  )+£- H[PTWg)  +  N  (B-13) 

where 

W(a)-(Re{MA“oo},Im{A(^«)o)}(MA(^oi)},..)  (B-14) 

and  where  P  —  A,/  €  A/3;  «,t>  6  {r ,»* }]  is  the  real-valued  2m3  X  2m3 

matrix 


£  oo 

£oi  0 

« 

« 

,  o  £*-i,*-j  . 


(B-15) 
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Ek 


Pkr,kr 

Pkr  ,ki 

COS 

-sin 

Pki  ,kr 

Pki,ki 

sin 

cos  <K»| 

■ 

,  k  €  M2  (B-15) 


Note  that  P  is  orthogonal. 


The  Meuurement  Function 

Example  of  the  measurement  function  include: 


h(»)  —  M 

(B-17) 

*(*)—•  |*  |2 

(B-18) 

(B-10) 

where  “a”  is  a  real-valued  constant.  Note  that  with  (B-18)  and  (B-1Q),  the 
resulting  measurements  Sk  —  h{Fk  exp  {» })  +  iV*  are  not  affected  by  the  <f»4's. 
Moreover,  they  are  real-valued,  so  that  the  imaginary  parts  of  Sk  and  Nk  may  be 
ignored. 


Statistical  Assumptions 

We  make  the  following  statistical  assumptions. 

(1)  The  Additive  Noise:  The  2m 2  components  of  N  are  independent  and 
identically  distributed  (IID).  Consequently,  the  m2  complex  components  of  N  are 
also  UD.  When  it  comes  to  making  specific  calculations,  we  will  also  assume  that 
each  component  of  A  is  Gaussian  with  mean  0  and  variance  <r$/  2;  i.e.,  they  are 
2).  This  implies  E  [  |  Nk  | 2]  =  eft- 


(2)  The  Phase  Noise:  The  m2  components  of  4>  are  IID  with  density  *>+(♦). 
Specific  calculations  will  later  be  made  for  the  uniform  density  on  (0,2jr).  This 
models  a  complete  loss  of  phase  in  the  measurements  5 . 

(3)  The  Image:  The  2m2  components  of  g  are  IID  Gaussian,  0,<r 2/  2). 
Consequently,  the  m2  complex  components  of  g  are  IID  with  E  \  \  gk  | 2]  =>  <rf2. 

(4)  The  additive  noise  N,  the  phase  notice  4>  and  the  image  g  are  indepen' 
dent  of  each  other. 

These  assumptions  have  two  important  consequences.  First,  the  facts  that 
Sk  —  h(Fi,  «'**)  +  Nk  and  that  N  and  ♦  are  IID  and  independent  of  each  other 
and  of  g  imply  that  each  measurement  5*  (equiv.  £*)  depends  only  on  the 

corresponding  element  Fk  (equiv.  &)  of  the  transformed  image;  i.o.,  given  Fk, 
the  measurement  Sk  is  conditionally  independent  of  all  other  ’s  and  5,  ’s. 
That  is, 

P(jgl£)-  n  PG2*  I £*)  .  (B-20) 

Thus,  in  regard  to  estimating  F,  we  have  the  situation  of  one*for-one  indepen¬ 
dent  complex  measurements,  as  discussed  in  the  section  on  the  Cramer-Rao 
bound  for  complex  parameters  and  measurements. 

For  future  reference,  the  individual  measurement  statistics  p  (&  |  & )  may  be 
derived  as  follows 
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(B-21) 


p{&  l£i)  —  /  p(&  I**  J5i)p  (♦*)<*♦*  , 

o 

where 

P  (&  I  ♦*,£.)-  P*  (A  ■ -*  (ft  •'**))  (B-22) 

The  second  consequence  of  these  assumptions  is  that  /  and  F  are  Gaussian, 

because  they  are  linear  combinations  of  Gaussian  variables.  Their  means  are  zero 
and  their  covariance  matrices  are 

-2 

C ,  -£[//']-  WC,  W'  -  W8  (B-23) 

-2 

C>  -  TCf  T*  -  TVK8r‘  .  (B-24) 

Since  C#  is  diagonal,  the  components  of  /  are  independent  with  variances 

W„.  -  ^  ■  CB-as) 

They  are  not,  however,  identically  distributed  unless  all  weights  wp  are  identical. 
We  will  next  show  that  each  component  of  F  is  N[0 ,aj /  2),  where 

<r?  -  *,2~2  £  «Va  .  (B-20) 

m  j»eAf8 

and  that  if  and  F* i(-  are  independent,  for  any  A: .  It  is  not  true  in  general,  that 
Fku  and  Fh  are  independent  for  k  /l . 


Using  the  independence  of  the  components  of  /,  we  find  that  for  any 


0in.n,i“»£ 


£  V,,/,.  E  *Vfi  /,. 


>» 


f* 


Now  if  A  — /  and  u  —1/ ,  we  have 


sMl  -  £  3  <n2,,. 

J)W  * 


-  &  • 


(B-B7) 


(B-28) 


and  this  establishes  (B*26).  Finally,  if  A  u  —  r  and  e«»i,  then 


mthi\  -  E  i  <  (nr,, r  **,>  +  T*r,„  7^  ) 

r  * 

-  ~  £  V  — a  (“c(p,A)s(p,A)  +  «(p,A)c(p,A)) 


-0  , 


(B-20) 


and  this  implies  that  F*r  and  Fj,  are  independent. 


Bounds  to  Estimation  Error 


The  problem  to  to  estimate  g  —  |j( ;  *  6  M3)  from  5  —  [St  ;f  eM3).  We  seek 
to  lower  bound  the  minimum  possible  mean  squared  errors.  Let  g  «*•  [g, :  i  6  M2) 


denote  an  estimate  of  jj,  let  'g  ( g  -  g)  denote  the  resulting  vector  of  errors,  let 


C*  Efg'g  |  denote  the  covariance  matrix  of  let  «  —  [«, a  6  M3]  denote 
the  vector  of  mean  squared  errors  (note:  «# ,  E  |jT,3)  and  «  1s  the  diagonal  of 


cy),  and  let 


»,  -  £  «M  -  y)  -  £[|  |£  1 12]  -  E(g  £|  (B-30) 

»eM*  L  ~  J  " . . 


denote  the  total  mean  squared  error,  Actually,  one  to  only  interested  in  the 
errors  in  components  of  g  that  are  given  nonzero  weight.  Correspondingly,  let 


(jm)sM®: 


(B-31) 


From  now  on  we  assume  g  is  the  optimum  estimate  of  g)  namely  the  ono  for 


which  each  component  of  e  to  smallest,  ( g  »  E  [  g  \  S  ] ).  We  seek  lower  bounds 
to  e  ,  e. ,  and  ef+.  Our  approach  to  to  first  consider  the  problems  of  estimating 
/  and  F  from  1 .  We  will  show  that  the  optimum  estimates  for  /  and  F  and 


the  resulting  mean  squared  errors  arc  closely  related  to  those  for  g.  The 


Cramer-Rao  lower  bound  is  most  easily  applied  to  the  estimates  of  F ,  and  such 
bounds  may  then  be  converted  to  provide  bounds  to  the  mean  squared  errors  in 


the  estimates  of  /  and  g. 

rsj 

The  fact  that  /  =  Wg  suggests  that  /  =  Wg  would  be  a  good  estimate  for 
/.  Indeed,  it  must  be  the  optimum  estimate  for  /,  because  if  there  were  a  better 
estimator  f  for  /,  then  the  estimator 


fi 

t 

~  pu 


-i-f 


U>.  pu 


0  ,  wp  —  0 


(B-32) 


would  be  better  for  g  than  g ,  which  would  contradict  the  latter’s  optimality.  We 

rsJ  r+J 

A 

conclude  from  this  argument  that  the  optimal  estimate  /  for  /  is  related  to  that 
of  g  via 


/  -  Wg 

'•oj 

(B-33) 

/-IVf 

(B-34) 

c  = 

~/  ~f 

(B-35) 

C  f  »  wc  rw* 

(B-30) 

-  E  • 

0*.«)eM8 

(B-37) 

By  the  same  argument  the  optimal  estimate  /  for  /  is  related  to  the  optimal  esti- 

rw  aw 

A 

mate  F  for  F  via 


F  =  Tf 

(B-38) 

AW  AW 

F-Tf 

(B-39) 

AW  AW 

/  =  TC  ~T 1 

(B-40) 

eF  =  ej  , 

(B-41) 

where  tho  orthogonality  of  T  was  used  to  obtain  (B-41). 

We  will  now  find  the  Cramer-Rao  lower  bound  to  the  mean  squared  error  in 
F,  and  then  use  (B-33)-(B-41)  to  find  lower  bounds  to  the  mean  squared  errors  in 

a  m  a  - 

/  and  g .  The  Cramer-Rao  lower  bound  applied  to  F  and  F  yields 


*F,t  t/p  +  Kp\al  i  1  €  M8  ,  (B-42) 

where  1F  and  KF  are  the  2m2  X  2m2  matrices  defined  by 

JF  ***  E  £•(•//>■  ]  (B-43) 

Jr.,,  -  B  s  /  -±  !n  p(S  I  F)  to  p(S  |  F)  j  (B-44) 

~ez  I  W.]n,{~)Wl  '"'©I  '  I®-46* 


where  EF  and  Es  denote  statistical  averages  over  F  and  5,  respectively.  The 
Cramer-Rao  bound  also  gives  the  more  complete  result 


X. 

V 

i 

a 


Cr>[J„  +KY'  , 


(B-46) 


where  A  >  B  means  that  A  -  B  is  a  non-negative  definite  matrix.  Note  that 
since  A  >  B  implies  Aik  >  Bik ,  equation  (B-46)  implies  (B-42). 

Now  from  (B-40)  and  (B-46)  we  obtain  a  lower  bound  to  the  covariance 

A 

matrix  of  the  errors  in  /: 


r  C  ?  >T 


>r‘(7,  +  K,\~lT 
~{T'(Jp+Kf)T\"' 

=»  (T*4  JpT  +  T* Kp  r]'1  , 


(13-47) 


where  we  have  used  the  fact  that  T*  AT  is  non-negative  definite  if  A  is,  and  con¬ 
sequently,  A>B  implies  T*  AT>T*  BT , 

Finally,  we  obtain  a  lower  bound  to  the  mean  squared  errors  in  £*  from  (B- 
35)  and  (B-47): 


•/,.  ^  i T'J,T  +  T'Kf  T]"1 


»'*  -TTT  — 


,  if  4  *=  (p,u)  and  wp  >0  .  (B-48) 


Clearly,  etil  =0^/2  if  *  —(ptu)  and  wf  =  0.  It  remains  for  us  to  compute  JF 
and  KF . 


The  matrix  KF : 

Using  the  Gaussian  nature  of  F ,  direct  calculation  (see  also  (A-4.12)  in  the 
discussion  of  the  Cramer-Rao  bound)  gives 

Kf  -  Cfx  .  (B-40) 

Thus  the  term  involving  KF  in  (B-47)  is 

T'KfT  -  T'Cf'T  -  Cjx  a  ,  (B-50) 

u 

where  we  have  used  (B-23)  and  (B-24).  Note  in  particular  that  X*  A>  T  is  a  diag¬ 
onal  matrix. 


The  matrix  JF  \ 

As  mentioned  earlier,  the  measurements  in  S  —  \8k  —  *  eM'*\  are  one-for- 
one  and  independent  in  regard  to  estimating  F  [Fk  \  k  6A/8].  As  discussed  in 
Section  A.5,  this  implies  the  matrix  JF  has  the  form 


JF 


J (00)  0 

J(  01) 

0 


(B-Sl) 


where  J^^keM2,  is  the  2  X  2  matrix  in  the  Cramer-Rao  bound  for  estimating  £* 
based  on  That  is  for  any  u  ,v  €{r  ,i } 
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[*fy)U  —  E 


fir  l>rUi  l£i )~  In  p(&  la) 


(B-62) 


Since  the  joint  distribution  of  Fa,S*  does  not  depend  on  k,  the  matrices 
7(A)  —  [/(*)!  are  the  same  for  all  k,  and  we  denote  such  by  J(,y  Then 


jF  » 


7(0  ^  0 

J(«) 

0 


(B-53) 


It  remains  only  to  find  the  2  X  2  matrix  7(«).  This  will  be  done  for  the  three 
choices  of  h  given  in  (B-17)-(B-10).  We  shall  see  that  for  each  of  these,  J^t)  has 
the  form 


7(.)- 


4  0 
0  d 


(B-54) 


where  d  depends  on  h  and  the  statistics  of  N  ,4>  and  F.  Consequently,  JF  is 
itself  a  diagonal  matrix: 


I 

4  0 

4 

4 

i 

o 

*  • 

(B-55) 


Assuming  that  J?  is  diagonal  we  now  substitute  (B-'iO)  and  (B-55)  into  (B- 
47)  and  find 


-  -  aaaes, «  Letts  i  i  xjj. x -  «•  imamr 


a  iaca&aaasa  a  a  aaxajuiaaatnattiBi  as  McaBantana 
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o  >  [r‘7,  t  + 

“  ['  +  7!  H" 


«,  >  £ 


2c. 


and 


+  a  — ,  ^  «-»  2cj 

c#  “  .  J  <P*  -  _  2 

(f.-jeM*  >e«»  «V 

*,  >0  *,  >0 


,&•  +  2/  *» 
■|>0 


In  the  special  case  where  n  pixels  are  weighted 
els  are  weighted  by  wt  —  0, 


(B-62) 

by  w,  —  1  and  the  remaining  pix- 


t.  - 


2n 


i  +  2/ 


The  Three  Special  Cases 


Let  us  now  assume: 


(i)  The  components  of  the  additive  noise  N  are  Gaussian  N( 0,*#/  2). 

(ii)  The  random  phase  4*  is  uniformly  distributed  on  [0,2n|. 

For  each  of  the  three  choices  of  the  measurement  function  A  given  in  (B-17)-(B- 
10),  we  will  show  that  the  matrix  7(,j  has  the  form  given  in  (B-54)  and  we  will 
find  an  expression  for  the  paramater  d . 


Case  1]  h{x )  =  as 

Recall  that  7(«)  is  the  matrix  associated  with  the  Cramer>Rao  lower  bound 
for  estimating  £j  »■  (F*r  )  from  Si  —  (5*,  ,Ski )  where 
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t 


S4  -  ,  (B-64) 

where  {Fkr  ,Fki )  are  independent  N(0 ,<r$/  2),  v}  —  «>2i  where  is  uni¬ 

formly  distributed  on  [0,2 jr],  where  (NirtNu)  are  independent  N(0,<rjt/2)t  and 
where  and  are  mutually  independent.  To  simplify  notation,  from  now 

on  we  omit  all  subscripts  A;  e.g.  we  write  £  —  (St  ,S< )  instead  of  £*  —  (54r,54l). 
Also  recall  that  for  u  ,v 6{r  ,f } 


jfr  la  PCS  I £)-jfr  l»  Ptf  l£) 


(B-65) 


Using  the  specific  form  of  A  and  the  densities  of  Af  and  4  in  (B-21),  (B-22)  and 
integrating  gives  the  measurement  statistics 


u 


\S-»Ft'*\a 

p(il£)-/-^«"  '* 


0  "Wat 


S  l»+  »a\F  l» 


|  *.  1  ffJJ/U  e 
***  *&/  2 


(B-66) 


where  /„(  •)  denotes  the  zeroth-order  modified  Bessel  function  of  the  first  kind; 
i.e. 

/.(«)-  T,  /•* "’i<x  •  (B‘67' 

For  future  reference  we  note  that  since  p(£  |£)  depends  on  £  only  through 
|  F  | ,  we  have 


(B-68) 


p(fi|  |P|)-p(£l£)  • 

Also  for  future  reference  we  now  show  that  the  conditional  density  of  the  magni¬ 
tude  |  S  |  given  £  (  or  |  F  | )  is  Rician: 

P|S|(*  I  £)--j  All*  IS*  l£) 


which  has  the  form  of  a  Rician  density.  Since  p(  |  S  |  |  £)  depends  on  £  only 
through  |  F  | ,  we  also  have 


To  compute  /(,),  we  find  using  (B-68) 


I  1 

t 


vl 

II 

i 

■nl 


$ 


$ 


In  p  (5  I  £ )  “ 


,  <*is  | . \a\  | 

l  °s!  2  ) 


«  I g  I  9|/M 

<Tj^r/  2  OF, 


r  \*\S\  Ifl)  9Ut  2  ^ 

*  l  »j&/2  I 


_o_s  g  |F  |a 
<rft  dFu 


(3-72) 


where  /'  ,(t)  denotes  the  derivative  of  /#(s ),  where 


0,<,) 


(B-73) 


and  where  we  used  the  facts  that 


an  __£•  ,±m?  _2P, , 

af#  IT]  ’  »rt  • 


(B-74) 


Then  substituting  (B-72)  into  (B-65)  yields 


I'ciU  -  $  f.f,  b, 

9N 


[(°‘ 


9n!  2 


(B-75) 


We  next  show  that  the  term  inside  the  inner  parentheses  has  expected  value  zero. 
Specifically  using  (B-60) 


Es 


IW  ]-!«■  PHfcF  IW  '"'I' i  '''»<•< 1 


00 


; 


.  I&*  ib  +  «ai p  ia 


o  /  \±imn 

l  °n!  ^ 


111  ILL  /  1.LLU/L1  |  e"  •*  d\s\ 
\n  *fr/2e[  <?n!  2  I  11 


i'ijm* 


.-liL1 


_f_e-  - 


*h\  r\ 


.  •an  ,  _ .  .  inv 

•  2  “  * 


—  a 


(B-76) 


where  we  used  the  integral  formula 


with 


00 


//'  .  (d*)aV«*s 


rfj 


,  i.1 

A  \B 
4B3 


A 


ULLL*  lB.i 

•A  ^ 


Using  (B-70)  in  (B-75)  gives 

[■'(.)]„  -^7  r.r.Mirih'1) 

where 


(B-77) 


(B-78) 


(B-70) 
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Hft ) 

Thus,  J(t)  has  the  form 


a  i 


(B-80) 


M  ~^m\F\  )-«s) 

ffN 


F2  FrF( 
FfFi  F2 


(B-81) 


The  next  step  is  to  find  7(,j,  the  expectation  of  /<,)  over  |  F  \ .  To  this  end, 
observe  that 


Bc  [(«( I F  |  y^KFi  ]  -  Et  [ff(  |  F  I  )F,Ft  ]  -  [f,f,  ]  .  (B-82) 

The  second  term  is  zero  because  F,  and  Fi  are  uncorreiated  with  zero  means. 
The  first  term  is  also  zero  because  Ft  and  F(  are  also  conditionally  uncorrelated 
with  conditionally  zero  means  when  given  any  value  of  |  F  \ ,  Thus 

IH*)*V*i]  “0  (B-83) 

Next,  observe  that 


Ec  [(//(IFD-’K’]  -bc  [«h(|f  D-«s)f,s] 

Finally,  using  (B-81),  (B-83)  and  (B-84),  we  obtain 


where 


0 

d 


I 


(B-84) 


(B-85) 
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We  have  not  been  able  to  find  a  closed  form  expression  for  6 .  However,  using 
the  fact  F,  —  |  F  |  cos  9,  where  |  F  |  and  9  are  independent  random  variables 
with  9  uniformly  distributed  on  (0,2ir)  and  with  |  F  |  Rayleigh  distributed  with 
density 

-ILL* 

p(  I  ^  I )  -  *  (B-88) 

Op  l  2 

and  using  (B-87)  and  (B-88)  and  simplifying  a  bit  gives 


oooozsr 

o  o  0  t 


|  cos  9fp(9)p{  |  F  I  )p(  |  S  |  I  |F  \)Ud  |f  \d  \S 


’N  77(''.dS|  IM))’  |f,a|r|  ' 

»|7'  0  o  /.(Is  I  |f|)  |S|  |F|  * 


I"ii7>  +  ^|s|,7r- 


^  I  f  M  I  f  I 
(B-80) 


Notice  that  6  depends  only  on  7N/a  and 
Case  2:  A(* )  —  a  |  a  | 2 
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This  case  was  previously  analyzed  in  Section  2.  Here 


Si  —•  1*11*+ AT,  ,  (B-00) 

where  (F^,,F^)  are  independent  n(0,ap/  2),  op  —  u>at  where  /V*  is  real¬ 

valued  iKO^jv)  and  where  &  and  N *  are  mutually  independent. 

Proceeding  as  in  Case  1,  we  drop  the  subscript  k  and  find  the  measurement 
statistics 


P(S  \E)-Pn(S-*  \F  |2) 

(s-i  I*1!*)8 

^  1  ,  'ft 


(B-Ol) 


Then 


In  ,(S  I £)  -  In  (3-02) 

—■  In  p (S  |£)-  -i-Jf,(S-.  |F  |J) 

or*  9fi 

-  AT  (B-93) 
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I nj»(5  |£)~r  \*P(8\E) 


oft 


8a_J 
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F»F, 


(B-04) 
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mamm; 


(B-05) 


F'Fi 

Fr 3 


Now,  averaging  over  £  gives 


where 


4  a  2Of 

m2»fi 


s  V 


(B-06) 


(B*07) 


(B*08) 


Notice  that  </  is  a  kind  of  signal  to  noise  ratio.  Substitution  into  (B-60)  gives  a 
result  in  agreement  with  that  in  Section  2. 
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Cue  3:  h(z  )  =  a  \x  | 


Here 


Sk^a\Fk\+Ni  ,  (B-99) 

where  Fkf  ,Fki  are  independent  2),  trj  -* /r#2m"4£,  w,2,  where  /V4  is  real¬ 

valued  iHO.ffA-)  and  where  &  and  Nk  are  mutually  independent. 

Proceeding  as  in  Case  2,  we  drop  the  subscript  k  and  find  the  measurement 
statistics 


P(S  | £)-p«,($-a  IF*  |) 
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(B-100) 


Then 


lnp(5  |£)«ln-ri«. 


1  (g-al^l)2 
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(B-102) 
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tap(5  \E)^r  bp  (8  I  £) 


d 
dFu 

s  F,  F, 
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|F|! 


Averaging  over  £  gives 


where  we  have  used  the  fact  that  E[F2/  |  F  | 8]  ■  1/2  .  Hence 


where 


i 


(B-103) 


(B-104) 


(B-105) 


1 


(B-106) 


Notice  that  here  i  does  not  depend  at  all  on  <r$. 
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Phase  Retrieval  for  Discrete  Functions  with  Support  Constraints:  Summary 


Thomas  R.  Crlmmlns 

Environmental  Research  Institute  of  Michigan 
P.  0.  Box  8618,  Ann  Arbor,  Michigan  48107-8618 


1.  Introduction 


The  phase  retrieval  problem,  l.e.,  the  problem  of  reconstructing  a  func¬ 
tion  from  Its  Fourier  modulus  or,  equivalently,  from  Its  autocorrelation  func¬ 
tion,  arises  In  many  fields,  e.g.,  astronomy,  wave-front  sensing.  X-ray 
crystallography,  electron  microscopy,  particle  scattering  and  pupil -function 
determination.  Here  we  consider  the  case  In  which  the  object  function  Is 
assumed  to  be  defined  on  a  two-dimensional  discrete  grid  of  sample  points. 

Bruck  and  Sodln  [1]  showed  that  uniqueness  of  phase  retrieval  In  this 
case  Is  equivalent  to  the  Irreduclblllty  of  a  polynomial  In  two  variables 
which  Is  closely  associated  with  the  Fourier  transform  (z-transform)  of  the 
object  function.  Flddy,  Brames,  and  Dainty  [2]  used  Elsensteln's  Irreducl- 
blllty  criterion  to  prove  uniqueness  for  discrete  object  functions  satisfying 
certain  support  constraints.  Flenup  [3]  showed,  among  other  things,  that 
object  functions  satisfying  certain  different  support  constraints  ("triangular 
objects")  are  uniquely  defined  by  their  autocorrelation  functions  among  all , 
other  object  functions  satisfying  the  same  support  constraints.  He  also 
presented  a  closed-form  algorithm  for  reconstructing  such  object  functions 
from  their  autocorrelation  functions. 

A  generalization  of  Flenup1 s  results  to  a  wider  class  of  support  con¬ 
straints  Is  presented  here.  Also,  an  algorithm  for  generating  closed-form 
reconstruction  algorithms  Is  described.  B.  J.  Brames  recently  obtained  a 
result  (unpublished)  similar  to  the  uniqueness  theorem  In  Section  3. 

Proofs  will  be  omitted  In  this  summary. 

2.  Masks 

Let  R2  denote  the  Euclidean  plane  and  let  Z2  denote  the  points  In  R2  with 

2 

Integer  coordinates.  A  finite  subset  of  V  Is  a  mask  If  It  contains  at  least 
three  non-colllnear  points  and  Its  convex  hull  In  R^  has  no  parallel  sides. 

Let  M  be  a  mask  and  let  [M]  denote  its  convex  hull  In  R2.  Then  [M]  Is  a  con¬ 
vex  polygon  (Including  Its  Interior).  A  vertex  v  of  [M]  Is  opposite  a  side  s 
of  [M]  If  the  line  through  v  and  parallel  to  s  contains  no  points  of  [M]  other 
than  v.  A  vertex  of  CM]  Isa  reference  point  of  M  If  It  Is  opposite  some  side 
of  CM] .  The  set  of  all  reference  points  of  M  will  be  denoted  by  R(M). 

3.  Uniqueness  Theorem 

o 

Let  f  and  f^  be  complex-valued  functions  on  Z  .  The  support  of  a  func¬ 
tion  on  Z2  Is  the  set  of  points  at  which  the  function  is  non-zero.  Let  S(f) 
and  S(fj)  denote  the  supports  of  f  and  fj  and  let  r  and  r^  be  the  autocorre¬ 
lation  functions  of  f  and  fj,  respectively.  We  have  the  following  unique¬ 
ness  theorem. 


J 

i 

i 

I 

I 

3 

j 

Theorem;  If  M  Is  a  mask,  R(N)  £  S(f)  £  M,  S(fj)  £  M  and  r  -  rv  then  there  j 

exists  a  complex  number  a  of  modulus  1  such  that  fj  ■  of. 

j 

4.  Reconstruction  Algorithms 

In  this  section  closed-form  algorithms  for  reconstructing  &  function  from 
its  autocorrelation  function  will  be  described, 


Let  S  be  the  number  of  vertices  of  [M].  Let  vQ,  vs-1  be  an  ordering 
cf  the  vertices  going  abound  [M]  in  the  counter-clockwise  direction  and  let 
Pgt  ....  pf.j.be  a  similar  ordering  of  the  reference  points  of  M.  It  can  bo 
shown  that  R(M)  contains  an  odd  number  of  points  so  that  T  Is  odd,  Let  K  ■ 

( T-l )/2  and  let  qn  ■  P(nK)moe|  f  for  n  •  0,  . ...  T  -  1.  Since  K  and  T  are 

relatively  prime,  the  qn  are  distinct  and  hence  run  through  all  the  points  of 
R(M).  It  can  be  shown  that  qn  and  R(n+i)mocj  j  have  unique  separation  In  M. 
Th«t  U.  if  x.  y  t  H  and  *  -  y  ■  q(nn)mo(!  T  -  dn  than  x  ■  q(n+l)lTOd  T  and 

*  -  v 

Let  N  be  tho  number  of  points  In  M.  .  A  reconstruction  algorithm  for  the 

mask  M  Is  an  ordered  pair,  (g.  m),  where  q  *  (qQ,  . ...  qN-1)  Is  an  ordering  of 

the  points  In  M  and  m  ■  (mT,  ....  mN-1)  Is  a  sequence  of  Integers  satisfying 

the  following  conditions.  The  points  g0,  . ...  qT-1  are  as  described  above. 

For  n  ■  T,  ....  N  -  1,  the  Integers  m^  satisfy  the  condition  0  <  mn  <  T  -  1, 

and  M  n  (M  +  qn  -  qn  )  £  |qQ.  ... ,  qnj  and  M  n  (M  -  qn  +  qm  )  £  |qp*  .... 

_  l  n  n 

qn-l’‘ 

In  the  next  section  an  algorithm  for  generating  such  reconstruction  algorithms 
will  be  described. 

In  order  to  justify  the  above  definition  of  reconstruction  algorithms  It 
will  now  be  shown  how  they  can  be  used  to  reconstruct  a  function  from  Its 
autocorrelation  function. 


Let  f  be  a  complex-valued  function  on  Z^  satisfying  R(M)  £  S(f)  £  M  and 
let  r  be  Its  autocorrelation  function.  Since  qn  and  )mod  T  have  un1que 
separation  In  M,  It  follows  that  r(q(n+1)(nod  T  -  q„)  »  f<P(n+i)m0d  T>  *  f(qn}* 
where  the  *  Indicates  complex  conjugation.  Therefore, 


K 

nU0  r(q2n  "  q(2n+l )mod 

- ^ - 

n50  r^2n+2  “  q2ri+l> 


(1) 


Since  f  Is  defined  by  r  only  up  to  multiplication  by  a  modulus  1  complex 
scalar,  we  may  require  that  f(qQ)  >  0.  Then  f (qQ)  Is  equal  to  the  positive 
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square  root  of  the  right-hand  side  of  Eq.  I.  Now  f(qn)  can  be  computed  for 
n  ■  1,  ....  T  -  1  from  the  formula  f(qR)  ■  r(qn  -  **  can  be 

shown  that  If  (q,  m)  Is  a  reconstruction  algorithm  then  the  following  program 
will  compute  f (qn )  for  n  ■  T,  ...»  N  -  1. 

Set  f(qj  -  0  for  n  ■  T,  ... .  N  -  1  and  set  n  -  T  -  1. 

n 

Step  li  If  n  ■  N  -  1,  stop.  Otherwise  n  *  n  +  1. 

■  n-1 

Step  2:  f(q„>  «  r(qn  -  q^)  -  E  f(qk>  f(dk  -  q„  ♦ 

Step  3:  Go  to  Step  1. 


5.  Algorithm  for  Generating  Reconstruction  Algorithms 

It  will  be  assumed  that  we  are  given  a  sequence  of  all  vertices  vQ,  ...» 
vS-i  °*  CM]  where  M  Is  a  mask  and  the  sequence  Is  ordered  In  the  counter¬ 
clockwise  direction  around  [M], 

For  n  -  0.  ....  S  -  1,  let  sn  be  the  side  of  [M]  with  endpoints  vn  and 

v(n+l)mod  S'  Let  b  b8  th8  °P*»*ator  on  rZ  wh1ch  rotate*  each 

vector  In  R2  90°  counter-clockwise. 

First,  the  reference  points  p0,  ....  pT-l  must  be  found.  Note  that  every 
side  of  [M]  has  a  vertex  opposite  It  which  Is  therefore  a  reference  point.  Of 
course,  several  sides  may  have  the  same  vertex  opposite  them.  Let  wn  ■ 

v (n+1  )mod  s  *  vn  for  n  "  °*  S  “  l*  It;  can  be  shown  that  a  vertex  vm  1s 
opposite  a  side  sn  if  and  only  If  <vm,  Uwn>  >  <vk,  Uwn>  for  k  ■  0 . 

S  -  1,  where  <,>  denotes  the  Inner  product  on  R2.  Thus,  by  taking  each  side 

In.  the  order  . sS-1  all  the  reference*  points  of  M  will  be  found  and  If 

they  are  numbered  In  the  order  In  which  they  are  found,  Pq,  ...,  P-j-.j*  then 

the  ordering  will  be  In  the  counter-clockwise  direction  around  [M], 

As  mentioned  above  T  Is  odd.  Let  K  ■  (T  -  l)/2  and  qn  ■  P(nK)mo(j  j»  n  * 
0,  T  -  1.  Since  each  reference  point,  qn.  Is  a  vertex  of  [M],  there 


exists  an  Integer  kn  ^uch  that  0  <  kfl  <  S  ■  1  and  qf 


L  *  Vi,  .  For  n  ■  0,  .... 
"  *n 

Then  It  can  be  shown  that 


T  -  1,  define  y  ■  Uw(.  ,,  d  $  -  Uwk  .  Then  It  can  be  shown  that 

'*n  (n+l)mod  T 

for  x  6  M.  x  t  q„  and  *  f  q(ntl)mod  T.  <V  *„>  <  <*.  T'  yn>' 

(The  uniqueness  of  separation  of  qn  and  d(n+i)mocj  y  mentioned  In  Section  4 
follows  from  this  double  inequality.) 


Now  let  otn  -  qn  q(n+1)fnod  T  and  let  Bn  »  an/2  for  n  -  0,  ....  T  ■  1. 

Let  D  ■  H-'R(M)  (set  difference)  and  let  ♦  be  the  characteristic  function  of  D 
as  a  subset  of  Z2.  That  Is,  0  1*  the  function  on  Z2  which  Is  1  on  D  and  0 
outside  0.  The  set  D  contains  N  -  T  points.  For  x  e  D  and  0  <  n  <  T  -  1,  let 
h  (x)  ■  <x  -  Bn,  yn>.  Now  define  T  orderings  of  the  points  In  D,  D  -  )dn>(JI 

....  n  -  0,  ....  T  -  1,  satisfying  |hn(dn|k)|  > . 

k  ■  0,  . ,  N  -  T  -  2. 


"n(dn,kU>l  for 


Set  n  ■  T  -  1  and  k  ■  0  and  enter  the  following  loop. 

Step  1.  If  n  ■  N  -  1,  stop.  Otherwise  define 

t  -  min {j  i  0  <  j  <  N  -  T  -  1  and  <fr(dkjj)  *  l|. 

Step  2.  If  0(«k  -  dk  t)  ■  1,  go  to  Step  7. 

Step  3.  n  «■  n  +  1. 

Step  4.  Define  qn  ■  dk  t. 

Step  5.  If  hk(qn)  >  0, ’define  mn  ■  k.  Otherwise  define  mn  “  (k  +  l)mod  T. 
Step  6.  4>(qn)  ■**  0. 

Step  7.  k  *■  (k  +  l)mod  T  and  go  to  Step  1. 

It  can  be  shown  that  the  loop  Is  not  Infinite  and  If  q  *  (qQ . qN-1) 

and  m  -  (mT,  ....  mN-1)  then  (q,  m)  Is  a  reconstruction  algorithm. 

6.  Implementation 

The  algorithms  presented  In  the  last  two  sections  can  be  Implemented  with 
two  computer  programs.  The  first  program  would  Implement  the  algorithm  In 
Section  5.  Its  Input  would  be  a  mask  and  Its  output  would  be  a  reconstruction 
algorithm.  The  second  program  would  implement  the  program  In  Section  4.  Its 
Input  would  consist  of  a  reconstruction  algorithm  and  an  autocorrelation  func¬ 
tion  and  Its  output  would  be  the  object  function.  With  this  arrangement,  If 
one  wished  to  reconstruct  many  object  functions  using  the  same  mask,  the  first 
program  would  have  to  be  run  only  once. 
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It  to  shown  that  phaae  retrieval  for  two-dimensional  discrete  object  functions  eetiefyittg  certain  support  constraints 
is  unique  among  all  objoct  functions  satiafying  the  same  support  constraint.  Cloeea-form  reconstruction  algo¬ 
rithms  for  rsconstruoting  such  objsct  functions  from  thair  autocorrelation  functions  art  defined.  Alto,  an  algo¬ 
rithm  for  generating  that#  reconstruction  algorithms  Is  described. 


1.  INTRODUCTION 

The  reconstruction  of  object  functions  having  nonredun- 
dant  spacingt  was  discussed  previously,1  Hayts  and  Qua- 
tlerl3  showed  that  tha  boundaries  of  triangular  objects  can  be 
reconstructed  by  mtking  use  of  certain  spacing!  In  tha  object 
that  ars  nonredundant,  In  another  direction,  Bruclc  and 
Sodln*  showed  that  the  uniqueness  of  phase  retrieval  la 
equivalent  to  tha  Irreduciblllty  of  a  polynomial  in  two  vari¬ 
ables  that  is  closely  related  to  the  Fourier  transform  (i  trans¬ 
form)  of  ths  object  function.  Fiddy  et  al.*  used  Elieenstain’i 
Irreduciblllty  criterion  to  prove  uniquanesa  for  object  func¬ 
tions  satiafying  certain  support  constraints  and  showed  that 
Flenup'e  input-output  Iterative  Fouriar-tranaform  algo¬ 
rithm4"7  converged  faster  to  a  hotter  reconstruction  when 
these  constraints  were  sotiafled.  Flenup*  presented  e 
closed-form  algorithm  for  reconstructing  such  object  func¬ 
tions  from  thoir  autocorrelation  functions.  He  also  present¬ 
ed  a  similar  closed-form  reconstruction  algorithm  for  objects 
satisfying  a  triangular  support  constraint  and  thereby 
showed  that  auch  objects  ore  uniquely  defined  by  their  auto¬ 
correlation  functiona  among  all  object  functions  satisfying 
the  same  support  constraint. 

A  generalization  of  Fianup's  results  to  a  wider  class  of 
support  constraints  is  presjsntsd  here.  Also,  an  algorithm 
for  generating  closed-form  reconstruction  algorithms  is  de¬ 
scribed.  Bramea3  recently  obtained  a  result  similar  to  the 
uniqueness  theorem  described  in  Section  3. 

2.  MASKS 

Let  ft3  denote  the  Euclidean  plane,  and  let  Z  denote  the 
points  in  ft3  with  integer  coordinates.  A  finite  subset  of  Zl 
Is  a  mask  If  it  containa  at  least  three  noncollineer  points  and 
its  convex  hull  in  ft3  (the  smallest  convex  set  conteining  it) 
has  no  parallel  sides.  These  conditions  on  a  mask  ars  need¬ 
ed  iitthe  proof  of  the  uniqueness  theorem  in  the  noxt  section 
(proof  in  Appendix  A)  and  In  the  proof  of  the  algorithm 
presented  in  Section  S  (proof  in  Appendix  C),  Let  M  be  a 
mask  and  let  [M]  denote  its  convex  hull  in  ft3,  (Af]  Is  then  a 
convex  polygon  ( including  its  interior;  see  Fig.  1 ).  A  vertex  u 
of  (Af|  is  opposite  a  aide  a  of  [Af]  if  the  line  through  u  and 
parallel  to  a  containa  no  points  of  [Af|  other  than  u  (see  Fig. 


2).  A  vertex  of  [Af|  is  a  reference  point  of  Af  if  It  Is  opposite 
some  side  of  [Af]  (ice  Fig.  3).  The  let  of  all  reference  points 
of  Af  will  be  denoted  by  ft(Af), 

9.  UNIQUENESS  THEOREM 

Let  f  be  a  complex-valued  function  on  Z3,  The  support  of  a 
function  on  Z1  Is  the  set  of  points  at  which  the  function  Is 
nontsro.  Let  Jf (/)  denote  the  support  of  f.  If  J?(f)  Is  a  finite 
set,  the  autocorrelation  function  of  f  ia  defined  for*  €  Z3  by 

r(x)  m  V  /(y)/(y-*)*,  (1) 

where  the  *  denotes  complex  conjugation.  Let  f\  be  another 
complex- valued  function  on  Z  with  finite  support  J'(/i)  and 
autocorrelation  function  r;,  We  have  the  following  unique¬ 
ness  theorem: 

Theorem:  If  Af  is  a  mask,  fl(Af)  s  #(f)  c  Af,4’(/i)  s  Af.andr 
■  n,  then  there  exists  a  complex  number or  of  modulus  1  such 
that  f\  ■  af, 

The  proof  is  in  Appendix  A. 

4.  RECONSTRUCTION  ALGORITHMS 

In  this  section  cloeed-form  algorithms  for  reconstructing  a 
function  from  its  autocorrelation  function  will  be  described, 

Let  S  be  the  number  of  vertices  of  [Af] .  Let  d„ . cs_  i  be 

an  ordering  of  the  vertices  going  around  [Af]  in  the  counter¬ 
clockwise  direction,  and  let  p9 . pT- 1  be  a  similar  order¬ 

ing  of  the  reference  point*  of  Af .  By  lemma  A2  in  Appendix 
A,  R(M)  contains  an  odd  number  of  points  so  that  T  is  odd. 

Lot  K  -  ( T -  l)/2  and  let  ■  p(n/(!mod  r  for  n  -  0 . T  -  1. 

Since  K  and  T  are  relatively  prime,  the  qn  are  distinct  and 
hence  run  through  all  the  point*  of  R{M)  (see  Fig.  4),  By 
lemma  A4  In  Appendix  A,  qn  and  <7tn+nmo4r  have  unique 
separation  In  Af.  Thati*,ifx,y  GAf  andx  -y  -^+1)mudr 
-  t/n,  then  x  -  <7(n+i)mod  t  and  y  -  qn, 

Let  N  be  the  number  of  point*  in  Af,  A  reconstruction 
algorithm  for  the  mask  Af  ii  an  ordered  pair  (q,m),  for  which 
Q  •  («?d,  •  •  • ,  qN-\)  l»  an  ordering  of  th*  points  in  Af  and  m  ■ 
(mr, , . . ,  m\-i)  is  a  sequence  of  integer*  satisfying  the  foi- 
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•  # 


M2  [M2] 

Fig,  1.  Aft  la  *  mask,  Af}  l«  not  e  mask,  sines  |Af«)  has  two  parallel 
ilaw. 


algorithm!)  It  will  now  be  shown  how  thay  can  be  uied  to 
reconatruct  a  function  from  Ita  autocorrelation  function. 

Let  /  be  a  complex-valued  function  on  P  latiifying  RiM) 
s  i'if)  s  Af,  and  let  r  be  iti  autocorrelation  function,  Let  x 
-  din+umod  t  -  d n>  and  »uppoie  that  for  some  y  Q  P, 
f(y)f(y  -  <)*  1*  Oi  then,y  S  «i’(/)  s  Af  andy  -  x  E  #(D  c  Af. 
Alao.y  -  <y  -  x)  -  x  »  dtn+uimxn"*  dm  Since  fti+ innod  rand 
ft  have  unique  leparation  in  Af,  it  follow*  thaty  *  t/m+nmud  r 
and  y  -  x  ■  q„,  Therefor#  y  ■>  ftn.i>mud  r  l»  the  only  y  P 
for  which  /(y)/(y  -  x )*  ^  0;  hence 

'■.(dlH+Owad  T  “  d«)  “  /Idln+tlmud  r)fldn)*i  W) 

and  tinea  R (Af)  c  J?(/),  r(ft,4t>»odr  “  d«)  *  0*  It  n°w 
follow!  from  Eq.  (2)  that 
K 

jfj  Kffjn  “  dlln+Dmod  f) 

l/<r?o)la  ■  nh - (3) 

n^dibi+a  “<Zs«+i) 

Sinoe  /  ii  defined  by  r  only  up  to  multiplication  by  a  modulus 
1  complex  number,  we  may  require  that  /(?o)  >  0.  /(ft)  la 
then  equal  to  the  positive  aquare  root  of  the  right-hand  aide 

of  Eq.  (3).  /(ft)  can  then  be  computed  for  n  -  1 . T-l 

from  the  formula  /(ft)  *  r(ft  -  ft-i)//(ft-i)A  It  li  ihown 
in  Appendix  B  that  if  (q,  m)  ia  a  reconatruction  algorithm, 
then  the  following  program  will  compute/(ft)  for  n  •  T, . . , , 
Af  -  1.  Set/(x)  -Oforx  £  2s  and  x  ft,«  “0, . . . ,T  -  1, 
andaetn  •  T-  1. 


Step  1,  If  n  *  Af  -  1,  stop.  Otherwlae,  n  —  n  +  1. 
Step  2,  /(q„) 


i  i 


r(a*-<L.,) 


ft"  l 

s 

Ift 


/(q*)/(fl*  -  ft  +  dm,)* 


Step  3,  Go  to  step  1. 


Fig.  3.  The  circled  vertices  of  (Af]  are  l«-  reference  point!  of  the 
mask  Af, 

lowing  condition!.  The  point#  ft, ... ,  qr-i  are  aa  described 

above.  For  n  «  T . Af  -  l,  the  Integer*  m,  satisfy  the 

conditions  0  £  m„  £  T  -  1,  and  Af  H  (Af  +  ft  “  dm,)  c 

Ift . ft|  and  Af  Pi  (Af  -  ft  +  ft,)  s  Ift . q«-i|.  In  the 

next  section,  an  algorithm  for  generating  such  reconstruc¬ 
tion  algorithms  will  be  described, 
in  order  tu  justify  the  above  definition  of  reconit.  action 


8.  ALGORITHM  FOR  GENERATING 
RECONSTRUCTION  ALGORITHMS 

It  will  be  assumed  that  we  are  given  a  sequence  of  all  vertices 
ft, ... ,  vs- 1  of  Af,  where  Af  ia  a  mask  and  the  sequence  Is 
ordered  in  the  counterclockwise  direction  around  [Af], 

For  n  -  0, ....  S  -  1,  let  tn  be  the  side  of  (Af]  with 
endpoints  ft  end  t)<N+i)MPdX.  Let  (/be  the  linear  operetor  on 
!R 1  that  rotates  each  vector  in  Jda  90*  counterclockwise, 
First,  the  reference  points  po, . . . ,  pr-i  must  be  found, 


v2  «  p4 "  q2 


Fig.  4.  The  numbering*  of  the  vertices  and  reference  points  of « 
mask.  Here  S  ■  8,  T  ■  6,  end  K  ■  2. 
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Note  that  every  tide  of  [A/]  has  e  vertex  opposite  It  that  ie 
therefore  a  reference  point,  Of  course,  several  sides  may 
have  the  tame  vertex  opposite  them.  Let  u>n  ■  tWtimode  - 
vn  for  n  «  0, . .  ,  S  -  l ,  A  vertex  v„  Ie  opposite  a  aide  »n  if  and 

only  If  (i>„,  Uwn)  a  Uwn)  for  it  »0 . S-  t,  where(  ) 

denote*  the  inner  product  on  0  (tee  Fig.  S),  Thus,  by 
taking  each  tide  in  the  order  an, ....  ts-i.  all  the  reference 
points  of  M  will  be  found,  and  if  they  are  numbered  in  the 
order  in  which  they  ore  found,  pn, . . . ,  pr-i.  then  the  order¬ 
ing  will  be  in  the  counterclockwise  direction  around  (AT). 

A*  mentioned  above,  T  is  odd.  Let  A’  *■('/'-'  i )/2  and  qn  ■ 

PinKimud  r>  n  •  0 . T  -  1,  Since  each  qh  It  a  reference 

point  and  therefore  Is  a  vertex  of  [M],  there  exists  an  integer 

/t„suchthatO SknSS-  iand<jn“9k„.  Forn«0 . T- 

1,  define 

then  by  lemma  A3  In  Appendix  A,  for  x  €  M,  x  p*  qn,  and  x 
*  •fiii+iimod r,  (qn,yn)  <  (x,yn)  <  (dut+timHdny-i),  This  is 


equivalent  to  saying  that  ail  points  in  M  excluding  qh  and 
dot+umod  r  He  strictly  between  lines  perpendicular  to  yn  and 
pasting  through  t/n  and  qinp nmod  r  (*#«  Fig,  6),  (The  unique¬ 
ness  of  separation  of  qn  and  <7tn+i)im>d  r  mentioned  in  Section 
4  follows  from  this  double  inequality.) 

Let  ts«  »  <j«  4*  t?<rt+iimud  r,  and  let  ii„  -  iv„/2  for  n  -  0 . 

T-  1.  iin  is  then  the  midpoint  of  the  line  segment  Joining  qn 
and  <?(„+  nmi,d  r,  Let  D  ■  M\  R(M)  (set  difference),  and  let  0 
be  the  characteristic  function  of  D  as  a  subset  of  2,J;  l.e.,  0  ie 
the  function  on  Z1  which  it  1  on  D  and  0  outside  D,  For  x  £ 
D  and  0  i  n  &  T  -  1,  let  hn{x)  m  (x  ~  dm  >'«)i  then 
!hn(x)l/l!ynli  is  the  distance  from  x  to  the  tine  through  dH  and 
perpendicular  to  y„,  where  ll.yji  denotes  the  Euclidean  norm 
of  y„  (see  Fig,  7),  The  set  D  contains  N  »  T  polnu,  and  we 
dehne  T  orderings  of  the  points  In  D,  D  * 

n  ■  0 . T  - 1,  satisfying  IfMu  >1  it  |/iH(dfl,Hi)i 

for  k  »  0, , , , ,  S  -  T  -  2.  The  following  program  generates 

ssquences  q?, . . . ,  q^i  and  . . .  Set  n  «•  T  -  1 

and  k  ■  0  and  inter  the  following  loop: 

Step  1.  If  n  ■  IV  -  1,  stop,  Otherwise,  define 

6  *  mint/:  0  S  j  &  N  -  T  -  I  and  Md^)  -  1|, 

Step  2,  If  -  dkib)  •  1,  go  to  step  7, 

Step  3,  n*-»  +  l. 

Step  4.  Define  qn  ■  d*,t. 

Step  3.  If  hk(qn)  St  0,  define  mn  *  It.  Otherwise,  define 
■  (It  +  l)mod  T . 

Step  8,  0(qH) «-  0. 

Step  7,  k*-{k  +  l)mod  T  and  go  to  step  1, 

It  is  shown  in  Appendix  C  that  tha  loop  is  not  Infinite,  and 

if  q  m  (do . Qn- t)  and  m  ■  (mr . m,v-i)  then  l</,  m)  is  a 

rsconstruotion  algorithm, 

An  example  of  a  reconstruction  algorithm  generated  by 
the  algorithm  described  above  is  illustrated  in  Figs,  8  and  9, 
Figure  8  shows  a  mask,  a  numbering  of  Its  vertices,  and  the 
resulting  numberings  of  Its  reference  points.  Here  ■ 
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Fi|.  9.  A  rMonitruction  algorithm  (<),  m)  gdomted  for  tha  maak 
and  vertex  numbering  ahown.ln  Pit.  8,  The  numbers  undar  tha 
points  of  tha  muk  indicate  tha  ordering  g  ■  (go, , , . ,  gt»)  and  m  ■  (0, 
1, 4, 0,  l,  2, 3, 4i  0, 1,  2). 

(0, 3),  ■  (-3, 1),  Wi  m  (- 1 ,  -2),  uij  •  (l,  -2),  uit  ••  (2,  ~1), 

and  wt «  ( 1,  l).  Alao.^0  "  U(w a  -  ui0)  -  (8,  — l),y»  *  t/(ui»  - 
«»a)  -  (-3, 2),  yt  -  -  Wi)  -  (0,  -4),  ys  -  t/(u><  *  u»i)  ■ 

(2, 6),  and  y«  -  U(wn  -  u/n)  ■  (-6,  -l).  Tha  reconstruction 
algorithm  (</,  m)  that  ia  generated  la  llluatrated  In  Fig.  9. 


8.  IMPLEMENTATION 

Tha  algorithm!  preaented  In  the  laat  two  aeetiona  can  be 
implemented  with  two  computer  programa,  The  flrat  pro¬ 
gram  would  implement  tha  algorithm  in  Section  8.  Ifca  input 
would  be  a  maak,  and  its  output  would  ba  a  raconatruotlon 
algorithm.  Tho  second  program  would  implement  the  pro- 
grum  in  Section  4.  (U  Input  would  conaiat  of  a  reconstruct 
tlon  algorithm  and  on  autocorrelation  function,  and  ita  out- 
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put  would  be  the  object  function.  With  thla  arrangement,  if 
one  wiahod  to  reconatruct  many  object  funotiona  using  tha 
tame  maak,  tha  firat  program  would  have  t,o  be  run  only  once. 

7.  CONCLUSIONS 

It  haa  been  ahown  that  if  a  function  la  zero  outitde  a  given 
mask  and  ia  nonzero  at  the  reference  pointa  of  the  maak,  then 
it  la  uniquely  determined  (up  to  multiplication  by  a  complex 
number  with  modulua  1)  by  ita  autocorrelation  function 
among  ail  other  object  functione  that  are  zero  outside  the 
maak,  (A  maak  ia  a  eat  of  pointa  In  tha  dlacrete  lattica  whole 
convex  hull  haa  no  parallel  aidti.)  Moreover,  there  la  an 
algorithm  for  generating  raconatruotlon  algorithm!  for  any 
given  maak,  whioh  in  turn  can  be  uaed  to  reconatruct  object 
functione  satisfying  the  above-mentioned  condition!  from 
thair  autocorrelation  funotiona. 

Thia  theory  had  some  similarity  to  holography.  l0'u  How* 
aver,  leveral  (at  least  thraa)  rafaranca  pointa  ara  uaad  hara, 
whartaa  only  tma  rafaranca  point  ia  naadad  in  tha  holograph¬ 
ic  altuation.  On  tha  othar  hand,  tha  holographic  raferance 
point  muat  be  laoiated  from  the  raat  of  tha  object,  wharaaa  no 
aueh  laolation  of  the  reference  pointa  ia  required  here,  It  ia 
intereating  to  epeculate  whether  there  might  be  a  more  gen¬ 
eral  theory  of  which  thia  theory  and  holography  would  both 
ba  apaciai  case*. 

APPENDIX  A 

Let  S,T,o*  for  »»»<>, ,.,,S-  1  and  </„  for  n  •  0, . ,  ,  ,T-lbe 
daflnad  aa  in  Saotion  4.  Tharefore  all  the  qH'%  referred  to  In 
thia  appsndii  ara  rafaranca  pointa.  Alao,  lat  U,  a*,  and  i o* 
for  n  ■  0, . . .  ,8  - ).  and  hH  andy*  for  n  »  0, . , , ,  T  -  1  ba  de¬ 
fined  aa  in  Section  6.  Lat  t„  be  tha  aide  of  [R(M)\  with 
endpoints  p,  and  P(H+ummi  r  (»••  Pig.  10),  and  lat  u„  « 

Pla+oate  r  “  P*  for  "  ■  0 . T  -  l.  Wa  not*  for  future 

rafaranca  that  for  u,mG  Jf*.  (u,  Uu)  »  0.  Wb  «*  -ti,  {Uu, 
Uw)  ■<«/,  w),and  (t>,  Uu>)  -  {Uu,  lPu>)  •-{Uu,ui)l  Tha 
proof  of  tha  uniquenaaa  theorem  in  Section  3  required  a 
aeries  of  lemmas. 

Lemma  A1 

R(M)  ia  a  maak  and  R{R(M))  -  R(M)> 

Proof 

Suppoae  it  can  ba  ahown  that  every  vertex  of  [/f(A#l]  ia 
opposite  aome  aide  of  [A(A0|.  Slnca  at  moat  one  vertex  oan 
be  oppoaite  a  given  aide  and  the  number  of  vertlcea  equals 


Fig,  10  Tha  sat  |rt(M)|  li  the  convex  polygon  with  xidei  t„  i  - 

0 . 4. 

238 


l‘J8  •).  Opt.  Sue  Am.  A/Vol,  4.  No.  l/Jsnuary  108” 


Thomas  U,  Crlmmlns 


the  number  of  aides.  It  would  then  follow  that  every  tide 
must  have  a  vertex  opposite  It.  and  therefore  no  two  sides 
van  be  parallel;  hence  fi(A/)  Is  u  mask.  Also,  since  every 
vertex  is  opposite  u  side.  filfilAH)  la  the  set  of  all  vertices  of 
(fi(Af) |,  which  is  equal  to  the  set  fl(A/) ;  hence  fi(RIAf))  * 
fit  A/).  Thus  It  suffices  to  show  that  every  vortex  of  |fi(  Af  )|  Is 
opposite  some  side  of  |fi(A/)|, 

The  vertices  of  (fit  A/)  |  are  pm,nt  *0 . T-  1.  Let  m  be 

fixed  but  arbitrary,  0  S  m  ST-  l;thenp,M  la  also  a  vertex  of 
(A/],  and  hence  pm  ■  c*  for  some  k ,  Let  u„  be  the  vertex  of 
j  A/j  opposite  side  s  of  (Af|,  and  let  c/,  be  the  vertex  of 

l  A/j  opposite  side  **  of  [Afji  then  v*  and  ut,  are  In  fi(Af),  and  u„ 
-  ph  for  some  n,  (Refer  to  Fig,  10  and  take  m  »  0.  In  this 
case  k  *  4,  a  ■  7,  b  **  l  and  n  ■  2.)  If  ua  and  ut,  ere  the  same 
vertex,  then  there  can  be  no  side  of  (Afj  opposite  us,  but  u*  * 
Pm  6  fi(Af),  and  so  v*  must  be  opposite  some  side  of  [A/J, 
Therefore  a*  *  u*.  It  then  follows  that  tn,  -  Pi«+iimbd  f-  It 
will  be  shown  that  pm  Is  opposite  aide  (*  of  (fi(  Af)|,  That  la, 
m  wish  to  show  that  for  0  £  j  si  T  -  1  and  j  H 

(ty.  t/uM>  <  (pm,  l/u*>,  (All 

Since  ut  Is  opposite  side  of  [Af],  it  follows  that 

for  o  d  I  £  s  -  l  and  i  sf  a. 

<n*  (A2) 

and  since  p»  la  oppoalta  iv  for  0  S  i  A  S  - 1  and  i  e»  b, 

-ti*>  <0.  (A3) 

By  setting  i «  ft  in  expression  (A2),  we  obtain  <t/uiu„,ur»uri,Vi 
os-u#)  <0,  and  thus 

<ul(r-llnuxl.r“Pm<  Um„)  *  “  uk>  WPmUmiT 

<  0,  (A4) 

By  setting  i  ■  a  in  expreeelon  (A3),  we  obtain  (Uw^  uu  -  vj,) 
<  0,  and  thus 

l^iS+nmui.V  “  Pm<  Uu„)  m  U'l.VHImulil  “  yS*  WPimUmAT  “Pit)) 
■  (to*,  LUv b  -oa)) 

*  (  Uwh,  v„  «  i*k> 

<  0,  (A5) 

Since  ui*_i  imtHj  h,  v»(  *  pm)  and  Ui*+ nniwi  s are  dieting  verticea 
of  (A/),  the  veotors  fie- > im„a .s  ~  P«,  and  VuH>N*<Nia  “  Pm  are 
linearly  independent,  Let  p  i~  fi( Af>,  p  *  u^\\mais>  Pm* 
vu,+lmwj  #,  Thvre  then  exist,  real  numbers  «  and  li ,  such  that 

P  -  Pm  "  ««'u-lleiedN  ~  Pm>  +  HIUUHlewda  “  Pm)-  <AB) 

Also,  since  (u*,  f/ic*)  <  <p,  t/w*), 

0  <  (p  -  0*,  l/Wj,) 

"  (p  “  Pm'  U!yq,+  llmudS  “"Pm)) 
u  ‘At'U-llmudJI  ”  P«>  l^ul*+llnux|S  “  Pm)) 

T  d(V(*+i|tn()<j4|  ~  pffl I  +  S  ”  Pm) ) 


"  ,*i  “  P m1  ^  *L’ifc+luniH|.w  “  P„|)) 

■  -rttWis.iimudN,  ^it'ie+niMisix  “  t'v) ) 

"  **<  C’,*  +- 1  i,mH|  N  “  l'*)>  (Ai) 

Since  (L'utu-nwiidXi  I'u+nmori.-.-  “  L'k)  >  0,  It  follows  from 
expression  (A7)  that  o  >  0,  Similarly, 

0  <  (P  “  P*i  |  IHItkl  .S'  > 

"  (P  “  Pmi  f'lPiti  —  t'ik~|lm ,«((,-)> 

“  «(l\*-iiHtuSS  “  Pm<  *"  iA*-iihuiiJ.S‘)) 

■h  d(P|HI)irti>dS  “  Pm*  tAp fli  “  On, 

-  “rt(Pm  “  Pis-H«wt,Si  ^IPm  “  Uhuui 
+  d(n(*+|(m«dy“  P«.  (Apm“  t'u,  - 1 1  mod  if )  ^ 

*  d(Pi**liHMdjj  “  Pm*  i-^IPm  “  PiS-limud.s)) 

“  ^^UflHswla  ”  P**  ^%-litmxlti)'  lAd) 

Since  (y(*+i)miie.s  “  ps.  L/to,*- umod >  0,  It  follow*  from 
expression  (AB)  that  d  >  0,  Using  expansions  (A4),  (Ail), 
and  (AB)  and  the  fact  that  u  >  0  and  ill  >  0,  we  have,  for  p  * 
^IhrllMudSi  Pmi  LhS-t  I  anuria, 

iP  —  Pm*  Uurt)  “  w(t/n*|inluui-  "*  Pm ,  Vuh) 

+  d(P(t+|(inudif  “Pm*  Uu„) 

<0.  (AB) 

Inequality  (Al)  now  follow*  from  expressions  (A4),  (Ail,  and 

(A0),  Tnle  oompletev  the  proof  of  lemma  At, 

Iritmma  Aa 

The  number  f  of  points  Infi(Af)  Is  odd,  and  If  K  »  (T-  U/2 
and  0  ai )  &  T  -  1  then  p,,  us  u  vertex  of  (fi(Af)|,  Is  opposite 
eidetiK+omudrof  |fi(Af)j, 

Proof 

It  follows  from  lemma  AI  thut  every  side  of  |fi(Af)|  has 
exactly  ono  vertex  of  (fi(A/)|  opposite  It  and  every  vertex;  l.#„ 
every  point  In  R(M)>  la  oppoaite  exactly  one  side.  Thu* 
thnre  la  a  pooltive  integer  K  jS  T  -  2,  such  thut  pk*-i  Is 
opposite  side  t0,  p;  is  then  opposite  side  t**;,  and  m  Is 
opposite  side  ((x>-Jimnri  r*  More  goimrally,  p,  is  oppoaite  rid* 
)iK+;imuri  p  Setting  j  •  T  -  K,  we  find  thut  pr-H  i»  opposite 
to-  However,  pk+i  i»  opporlte  to-  Therefore  T  -  K  ■  K  +  l 
or  T  «  2 K  +  1.  Thl*  complete*  the  proof  of  lemma  A2. 

Lemma  A3 

For  x  6  [A/],  X  *  q„  Vi/e-iimuii  Ti  j  "  0 .  T  -  1, 

*V/ti'j)  ^  (<*y,i)  4  (Vq+iimiiri '/'*  y;)- 

Proof 

It  aufflcee  to  show  that  the  inequalities  hold  for  all  vertices  v 
of  |A/|,  o  *s  q,,  t/(/+umuri  Let;  be  fixed  but  arbitrary.  For 
convenience  let  in  -  (k,  -  Drnod  S  and  n  »  k0+i,m(Kl ,-;  then 
y,  -  UtCm  -  Uio„. 

First  we  will  ehow  that  Pq+umodr.  as  a  vertex  of  (Af|,  is 
opposite  side  h„  of  [A/J.  Let  u„  be  the  vertex  of  (A/J  opposite 
side  i„i  of  (A/J,  and  let  Vh  be  opposite  »im+iim.«i.s'i  then  i*„  und 
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Vh are  in  and  u„  •  px  for  some  k,  (Refer  to  Fig,  10  and 

take/  -  4|  then kt  *•  1,  m  *  0,  (j  4  i)mod  T *  0, n  «  kn  •»  4,  a 
“  4,  b  •  d,  and  k  -  0,|  By  the  argument  In  the  proof  of 
lemma  Al.  t'j,  *  Pia+i mimi  >\  and  t1  (*  Vy  *  t onud >s‘ ) ■ 

ai  a  vertex  uf  (ft(,V/)|,  ti  opposite  aide  t*  of  (ft(Aft|.  By 
lemma  A2,  PijKmoi  r  I*  opposite  fvmK  mw|  y,  Hern:#,  by  ism- 
mu  Al,  I*  •  tp+utvMud  r  and  k  -  ()  4  UK  mod  T,  Thua 
doMimud  r  “  PuMiKmmt  r  “  Ps  “  o,„  end  therefore  (Ju+uwdH  r. 
a*  a  vertex  of  [A/|,  ia  opposite  sides*  of  (  Vf| 

By  u  similar  argument  it  can  be  shown  that  qlt  aa  a  vertex 
of  ( Af),  la  opposite  aide  s„  of  (A/). 

Since  du+iiwud  r  l»  oppoalte  tm  and  v  *  qo+iimodfi  it  fol¬ 
lows  that  f*  '*■*  Us  Liu)*)  or  (du+iitsttaf  “  o, 

LW  >  0,  Also,  sines  c  t~  (Wj,  U'  *  I'm  (Mt  I  i  n,  There¬ 
fore 

"  Wo+tmuKt  r-t'.y^ 

*  <V(j4.uWtttj  r  - 1».  -  t/uirt) 

“  ^ueiWtedr-^Wlfii)  “<du+ll*etirM‘a-,  l/w,) 

"  <do*timua  r  “  w*  lAcJ  “  <c„  -  u, 

*  Wy+Umixt  r  “  l,i  -bio  -  0H,  t./w„> 

>  0,  (A10) 

Since  q,  ia  oppoalte  s„  and  v  ti  q„  <q/,  t/uij  >  <u.  Uwn)  or 
(q;  -  t>»  Uwh)  >  0,  Alto,  since  v  t~  (Af),  (P  -  oiMtdmitiX' 
lJwn )  &  0,  Therefore 

U't.Vy)  (d y..Vy)  *  (P  “tly,,Vy) 

•  <l>  “  <fy,  ~  (AC*) 

•  <u-q/(  Ui i»m>  4-  <qy-ti,  (M.) 

-  <c-uv(Avm>  4  <q;-e,  C/urN> 

•  <W  “  «t»wnm«ty.  +  <1j  -  u.  t/tv,) 

>0,  (All) 

It  now  followe  from  expreaeiona  ( A10>  end  (Ail)  that  (q>, 

>‘i)  <  (v,yy>  <  (du+timwi  r,  >>)•  This  completes  the  proof  of 
lemma  Al). 

In  the  remainder  of  this  appendix,  all  modulo  arithmetic 
will  be  mod  T.  For  convenience,  we  define  m  •  n  ■ 

(m  4>  nlmod  T, 

The  next  lemma  asserts  that  qt  and  q/«i  have  unique 
separation  in  AY, 

Lemma  A4 

For  0  sS  >  5  T  -  l,  If  xt,  xa  €~  M  and  x \  -  xj  -  q,*\  -  q>,  then 
xi  ■  q;«i  and  Xu  -  qt, 

Proof 

II  either  xi  ti  g,*,  orxy  ti  q,,  then  it  follows  from  lemma  Al) 
that 

U,  -  i.j.v,)  -  < x | ,  yt)  -  ,y;> 

<  <</y«i.^,)  -  (d/.yy) 

•  <d,«t  -<*,’>,)<  (A12) 
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which  contradicts  the  assumption  that  -  x<t  ■  qy*i  “qy, 
Therefore  x  i  **  d/eiantlxy  **  q;,  This  completes  the  proof  of 
lemma  A4, 

Let  g  and  h  be  complex-valued  functions  on  Z\  and  let  q  • 
h  denote  the  convolution  of  g  and  hi  that  le,  If  f  «*  g  ♦  h,  then 

fix.)  *  V  ,v(ui/t(x  -  u),  (Alb) 


We  define  A’(g)  +  .V(h)  •*  j.t  4-  >:  x  G  A’(g)  and  y  S  4’(h)|, 
The  following  lemrne  is  fundamental, 

Lemma  AS 

tf/ ■#*/»,  then  *  (A‘(g)  +  »m)|, 

Proof 

It  follows  from  Bq,  (AID)  that  l'(f)  a  i'(g)  4-  «1'(A>!  hence 
|4'(/)|  ti  [41(g)  4  l(h)j,  It  remains  to  be  shown  that  U'(g)  + 
4'Odi  u  [mm 

Let  x  be  a  vertex  of  (41(g)  4  j*(h)||  then  there  exists  u  y  $ 
ft*  such  that  for  *'  e  i'(|)  4  m  and  x'  *  x. 

<*', y>  <  <x(y>,  (A14) 

Also,  since  x  ia  a  vartax  of  (41(g)  4  it  follows  that  x  £ 
4(g)  4  Jf(h),  and  hence  there  exist  *i  6  4*(g'  and  x<i  &  4‘(h) 
such  that  r,  •  xt  4  xg,  We  will  show  that  this  decumpoaltlon 
of  x  is  unique,  Suppose  that  x  »  x/  4  xa'  with  xi'  Q  A'(g) 
and  x4'  £“  4“(h)i  then 

(*i,y>  4  (t;,,y>  a*  (Xi  4*a,y> 

■  (*>  y) 

-  (Xj1  4  Xg't  y ) 

,V>  4  <xa',,v),  (A  16) 

Therefore  either  <xt\  y>  &  (X|,.v)  or  (xuy)  £  (x%y)  or 
both,  Suppose  thst  (X|\  y>  Jt  (*i,  >),  Let  x1  •  X|'  4  xy| 
then  x'  6  41(g)  4  4'(h)  end 

t 

<x',y>  "  <*/,>)  4  (xu,y> 

st  <*i,y)  +  <*u.y) 

*  <x,y>.  ( A 16) 

Therefore,  by  Inequality  (AH),  x'  ■  x,  which  implies  that  xr 
■  X|  and  hence  *-/  »  xa,  If  (x/,  y)  2;  <x2,  y>,  a  similar 
argument  leuds  to  the  same  conclusion.  Therefore  the  da- 
oompooition  x  *  X|  4  x«  with  xt  <~  4'(g)  and  xy  G  4'(/i) 
is  unique.  Suppose,  for  a  particular  uu  6  Za,  that 
g(uo)h(x  -  mi)  ti  0:  than  uH  f~  A’(g),  x  -  u0  G  a’(/t)  and  x  -  u0 
4  (x  -  lot),  By  the  unlquenoos  of  the  decomposition  of  x  it 
followe  that  u»  *  xt  and  x  -  Uu  *  x»,  Therefore  fix)  • 
g(x|)A(sey)  ti  Oundx  4’(/'),  Since  a  was  an  arbitrary  vertex 
of  (A’(g)  4  g(A)|,  it  followe  thet  all  the  vertices  of  (4'(g)  + 
4’(/i)|  are  in  4 t(f),  and  therefore  (4(g)  4  4'(h)|  «  (,!'(/) j.  Thin 
completes  the  proof  of  lentmu  A5, 

W«  are  now  ready  to  prove  the  theorem, 

Proof  ofThaorem 

Since  r  ■  ri,  It  follows  from  the  results  of  Bruok  and  Bodin'1 
that  there  exisit  functions#  and  h  with  finite  supports  and  a 
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vector  d  6  U'1  such  that  /  •  g  »  H  und  /i(x)  •  g  •  /i|(jc  “  o'), 
where  /i|(x)  *  /H  -,v>*  for  .i  £  2,J. 

We  have  /f(A/)  a  >W f)  a  ,4’(/j)  +  A’(/i).  Therefore  there 

ex  let  . . tip- 1 1  4’Utl  ahd  6o . bf-\  £  A’l/i)  such  that 

y, -« ,  +  />,,  j*  0 . T  “  t.  (M7) 

Now  let;  be  fixed  but  arbitrary,  and  let  x  6  AM/Th  *  e*  dy, 

We  will  show  that  (d„  >■/  <  (x,  >■;),  Suppose,  to  the  con¬ 
trary,  thut  (x,  v,)  &  <d,,.v,),  Let  x'  “  x  4-  6, i  then,  by  using 
lemma  AD.  x'  £  A*0i>  4-  A’()i)  <5  |A’(y)  ¥  !Hh)\  *  (AM^l  s  [M), 
Ai,.,i, 

(x'.y,)  -  (x,yy>  +  <6/(  .Vj) 

S  +  (fyy,) 

<AW 

It  now  follows  from  lemma  Ad  that  x'  “  which  Implies 
that  x  *  a,,  contradiction  the  assumption  that  x  s  tty. 
Therefore  (a, uv,)  <  <xt>,/>,  By  a  almllar  argument  It  cun  be 
shown  that  If  x  £  #{g)  end  x  a  a,*i,then  <x,y;)  <<d/#i,yy), 
Thue,  if  x  £  arid  x  A  ay,  «y«i,  then 

(flyi.Vy)  <  <x,>‘y>  <  ‘ttyei.yy).  (A19) 

Alao,  by  similar  arguments,  It  can  ba  shown  that  If  it  £  A‘(h) 
and  x  a  6i,  by.i.than 

<f»yi.Vy>  <  <X,yy)  <  <6ymi.V/>.  <A20) 

Lat  A'lg)  -  A'(/»)  » |x  “  yi  x  t»  4*(g )  arid  y  £  Since 
«V(hi)  •»  — Jr'fAt),  4*  4'(h|> « *vcar)  -  #(H).  Also,  slnca/ifx) 

•  g  *  A|(x  d),  It  follnwe  from  tsmma  M  that 

(*'(/,)!  “  (4’(if)  -  4*(h»  +  d.  (AiU) 

Wa  will  show  that 

fly  -  J>li(  +  d  e  M,  <A2B> 

We  have  u,  -  b,»,  £  .Vf/O  -  4?(/t),  Let  J  ba  fixed  but 
arbitrary,  and  let  x  6  A'lgi  -  A’vh),  x  *s  <>,  *  <iy#i,  Than 
thar#  exist  xt  £  A'Ur)  and  xu  €  A’(h)  such  that  x  *  X(  «  xj. 
Since  x  k*  a,  -  h,#i,  either  X|  H  orx,;,  a  blt \  or  both,  In 
any  case  It  follows  from  expressions  (Alb)  and  (ABU)  that 

(*.>',)  "  \*|,yj)  -  V 
>  <u;,y;>  -  y;) 

*  <u,  -  6^|,y;>.  (ABU) 

Therefore  n,  -  h,*i  b  a  vertex  of  |.Vlg)  -  #(A)|,  anti  by  Eq. 
(A21),U;  - /)/«i  +•</ is  ii  vertex  of|.V(/|)|,  Therefore  u;  -  6(#i 
+  d  t»»  4M/|)  m  M,  mid  relation  (Ail'd)  follows, 

Uy  a  similar  uruumeni  it  can  be  shown  that 

U/#i  ”  b,  +  d  ("  M,  (A 24) 

Now 

,ui  "  b'0i  +  o' I  -  (u^|  -  4*  d) 

-(6l  +  6l)-(aJ#l4"6/Wl 
"  y,  -  y,*o  tAdfl) 


b,  +  bimi*d,  (A  11(1)  j 

Prom  h,  4*  />,« i  ••  d  and  ¥b^  *  d  we  obtain  bt  • 

Since  f  Is  odd  by  lemma  A2,  It  now  follow*  that  b,)  ■  bt 
-  , , ,  “  and  by  usln«  Eq,  (A20)  we  obtain 

f»4)  **•  f>,  -  -  6t,„,  «d/2,  (A27) 

from  expression  (A20)  and  Eq,  (A27)  we  obtain  AHh)  *  |d/2|. 
Therefore,  for  x  £ 

/(x)  -  g  *  hlx) 

*  /t(d/2)#(x  -  d/2),  (AM) 

If  /i(d/U)  *»  0,  then  f  would  ba  Identically  aero,  contradicting 
the  assumption  that  R(M)  s  .)*(/),  Therefore  Hid/ 2)  A  0, 
Porx  e  2IJ, 

/i(x)  *  hi(x  “  d) 

'  H(dj2)*g[x  -  d  4-  d/2) 

-A(d/2)*g(x  -  rf/'J) 

“  u/(x),  (ABO) 

where 

_  hld/2)* 

“  ‘Him' 

Sinca  iul  •**  1,  thin  completes  the  proof  of  the  theorem. 

APPENDIX  h 

U  will  be  shown  In  this  appendix  that  the  program  presented 
at  the  end  of  Section  4  computea  /( qn)  for  n  a  t, , , , ,  N  -  l, 

It  will  be  assumed  that  /(</«)  for  «*»(), . V  **  1  hoe  been 

computed  oa  described  In  Section  4,  Sllno*  0  S  m,  a  T  -  i, 
f{mn)  has  been  computed  for  n  •  T, . , . ,  IV  1. 

For  T  &  n  i  N  -  l  we  havj 

'*  </m.)  “  V  /(y'.f(y  **  C„  +  7mJ’  (Bi  t 
ysuJf* 

If y  6*  *V(0  ahtl  >  -  (/„  +  €  ■f(/')>  then,  since  AHf)  w  Af,  It 

follows  ihety  6  Af  and  y  *  +  i/Mh  W  nr,  equivalently,  y 

£M  +  d«  “  Pm,'  Thereforu 

,v  G  M  n  (M  4-  </„  -  Vw,>  «  l<j, . (/„|,  (Hi!) 

Hence 

/I 

-</„,)  -V/WV* 


fl 


*•  /(')„'/(</, „,)*  +  y  HdhUittk  ~  <f„ + '/«,„)*■ 

Ki 

i  mu 


Thus 


/(«/„)  " 


/«/„,)* 


ii -I 


'■((/„  -  dm,)  -  y  /Wi^/Wi,  -  4-  </„,„)• 


j 

ib4) 


Ly  lemmu  A4,  n,  -  /»,#i  4 •  d  •  •  a,  4-  6(,  from  which  wa 

obtain 
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An  induction  argument  will  be  used  to  prove  that  /(<jrt)  is 

computed  correctly  for  n  »  T . IV  -  l,  The  induction 

hypothesis  is 

H(n):  /(</;)  is  computed  correctly  forO  £  j  £  n.  i B5) 

By  the  derivation  in  Section  4,  Ii(T  -  ,  ,s  true.  Now 
assume  that  TSnSN-  l  and  H(n  -  1)  is  true.  We  want  to 
show  that  H(n)  is  true.  It  suffices  to  show  that  f(qn)  i9 
computed  correctly.  Let  all  variables  have  the  values  that 
they  have  at  step  2  of  the  pass  through  the  loop  in  which  f(qn) 
is  computed.  It  must  be  shown  that  all  values  of  {  appearing 
in  the  right-hand  side  of  Eq.  (B4)  are  correct.  Since,  by 
assumption,  H(n-l)  is  true,  it  follows  that  f(qk),  k  -  0, . . . , 
n  -  1  have  the  correct  values.  Also,  as  mentioned  above, 
fiqm„)  has  the  correct  value.  Let  x  «■<?*-  +  qm„  with  Oik 

<  n  -  1.  If  x  4  M,  then  f(x)  ■  0.  In  this  case,  since  8(f)  s 
M,  it  follows  that  x  4  8(f),  and  therefore  0  is  the  correct 
value  for /(x).  Assume  that  x  £  M.  We  have  x +  " 

<?*  £  M  and  therefore  x  £  M  -  qn  4-  q„n.  Hence  r  £MH 
(M  -  qn  +  <jm„)  s  too.  •  •  • .  It  now  follows  from  the 
induction  hypothesis,  H(n  -  1),  that  f(x)  has  the  correct 
value  when  the  value  for  f(qn)  is  computed.  Therefore  /(<?„) 
is  computed  correctly,  and  H(n)  is  true.  By  Induction  it 
follows  that  H(N  -  1)  is  true  and  hence  fin)  is  computed 
correctly  for  n  *  0, ....  N  -  1. 

APPENDIX  C 

In  this  appendix  it  will  be  shown  that  the  program  In  Section 
5  generates  a  reconstruction  algorithm.  First,  it  will  be 
shown  that  the  loop  is  not  infinite,  and  hence  the  program 
produces  sequences  qr . qn-i  and  mr, ....  m/v-i.  Sec¬ 

ond,  it  will  be  shown  that  if  q  ■  (<?o>  • . . ,  qy-i),  and  m  ■ 
(rar,...,my.|),  then  (q,  in)  is  a  reconstruction  algorithm. 

In  what  follows,  0  will  be  used  to  denote  both  the  number 
zero  and  the  origin  of  ft'2.  Context  should  prevent  any 
confusion. 

If  Jt.y.zS  ft2,  let  [x,  y,  z]  denote  their  convex  hull  in  ft2, 
If  x,  y  and  z  are  noncollinear,  then  the  interior  of  [x,  y,  z]  is 
given  by 

int(x,y,  z|  » |ax  +  by  +  ex:  a,  6,  c  >  0  and  a  +  b  +  c  *  11;  (Cl) 

then  0  £  int(x,  y,  *1  if  and  only  if  x,  y,  and  z  are  noncollinear 
and  there  exist  strictly  positive  numbers  a,  b,  c  such  that 
ax  +  by  +■  cz  ■  0.  (The  sum  of  a,  b,  and  c  can  be  rap.de  equal 
to  1  by  dividing  each  of  these  numbers  by  their  sum  if 
necessary.) 

The  following  lemma  will  be  needed. 

Lemma  Cl 

If  Mm  fn  £  ft2,  n  »  1,2,  3,  (fin,  v„)  >  0,  and  (Mm  I'm)  <  0  for 
n  id  m,  then  0  £  int[Mi,  M2,  ms|  and  0  £  inttoi,  m2,  mu]. 

Proof 

By  symmetry  it  suffices  to  show  that  0  £  int[Mi,  M2,  m»1- 
First,  we  will  show  that  mi,  M2>  and  ms  are  noncollinear.  If 
they  were  collinear,  then  one  of  them  would  be  in  the  convex 
hull  of  the  other  two.  If  mi  were  in  the  convex  hull  of  m2  and 
m.i,  then  there  would  exist  numbers  r2  and  such  that  r2, 
r:t  >  0;  t j  +  T-. )  “  lj  and  mi  “  r2M2  +  tjms-  In  that  case, 
however,  (mi,  mi)  “  (r^a  +  r:iM:i.  n>  ■  t2(ms,  mi)  +  r;J < M:i,  mi) 


<  0,  contradicting  the  assumption  that  (mi,  i'i)  >0.  There¬ 
fore  mi,  M2,  and  m:i  are  noncollinear. 

Since  any  three  vectors  in  ft2  are  linearly  dependent,  there 
exist  three  numbers  and  <ra,  not  all  zero,  such  that 

<T|M|  +  i r.jM'j  +  cr:iM:i  ■  0.  (C2) 

Since  (m«,  m„)  >  0,  Mn  r*  0,  n  ■  I,  2,  3,  Therefore  no  two  of 
the  a„  can  be  zero. 

Now  suppose  that  *  0;  then  cr2  A  0  *  <r;i,  and  it  follows 


from  Eq,  (C2)  that 

M2  **  ~(<rj/ '72)Mui 

(C3) 

hence 

0  <  <m2,  m2)  -  -  Win,)  (Mu,  m2), 

(C4) 

Since  (ms,  M2)  <  0,  it  follows  from  expression  (C4)  that 

-ar3/<r2  <  0. 

(Cfi) 

Also, 

0>  (m2,  Mi>  -  -(crg/cjXMai  n), 

(Cfl) 

and  since  (ms,  mi)  <0,  it  follows  from  expression  (C6)  that 
-O'Jaz  >  0,  which  contradicts  expression  (C5).  Therefore 
ci  ^  o,  and  by  symmetry,  era  *  0  r*  <r2.  By  multiplying  the 
<rn's  by  - 1  if  necessary,  we  may  assume  that  <ti  >  0,  We  have 

i < M 1 1  )  +  a2(M2.  "i>  +  ^(Ma^) 

-  < n'iMi  +  <r2M2  +  "iiMii,  Mi  > 

-  0.  (C7> 

Since  a i  >  0  and  (mi,  »i)  >  0, 

r2(M2,  +  «r0<M3.  Mt>  •  -e^Mi.Mi)  <0.  (C8) 

Since  (m2,  mi>  <0and  (ms, mi)  <0,  at  least  one  of  the  numbers 
ir2  and  a2  must  be  strictly  positive.  By  symmetry,  we  may 
assume  without  loss  of  generality  that  ir2  >  0.  Now 

*1<M|,  +  c2(m2,  m2)  +  (t3(m2,  m2) 

■  (c,M|  +  e2Ma  +  0;lM;|i  M;,) 

-  0.  (C9) 

Since  o\  >  0,  <r2  >  0,  (mi,  m,i>  <  0,  and  (m2,  m2)  <  0,  it  follows 
that 

"3>  *  ~ *r i  <Mi»  M3>  -  <t2(m2,  Mn) 

>0,  (CIO) 

Since  (mu,  m2)  >  0,  it  follows  that  a g  >  0.  By  the  comment 
preceding  the  lemma,  it  now  follows  that  0  £  int(Mi,  mj,  Mu). 
This  completes  the  proof  of  lemma  Cl. 

One  more  lemma  is  needed  before  proving  that  the  loop  is 
not  infinite.  As  in  Appendix  A,  we  define  j  ©  k  - 
(j  +  6)  mod  T. 

Lemma  C2 

For  j  ■  0, . . . ,  T  -  1  and  A  - 2 . T-1.0E  int|y,,  »i, 

Proof 

The  proof  will  be  by  induction  on  A.  Let  k  "2.  We  want  to 
show  that  0  £  int[y,,  y;#1,  y;,2).  Let 
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Pi  m  Mi  ~  Ms*  P‘i  *  M*  “  Mi*  Pa  ■  Qj  “  Ms* 

Kj-yy.i,  »amy y«*  (Cll) 

By  lemma  A3,  (ju,„  m„)  >  0  and  <m„,  fm)  <  0  for  n  r*  m. 
Therefore,  by  lemma  Cl,  0  £  int(ai,  i>2,  a3|  *  int(yy,  y;*i, 
Mai- 

Let  3  £  k  £  T  -  1,  and  assume  that  the  lemma  is  true  for  k 
-  1.  We  want  toshow  thatO  g  Intlyy.Mi,  We 

have  shown  that  0  £  intfyj,  y^i,  Mai*  anc*  therefore  there 
exist  strictly  positive  numbers  </i,  <to,  <n  such  that 

o +  ^jei  +  etfjM  -  0.  (C12) 

Applying  the  lemma  for  k  -  1  with  j  replaced  by  j  ®  1,  we 
have  0  £  int[y/#i,  m*  and  therefore  there 

exist  strictly  positive  numbers  ri,  rj,  ra  such  that 

TM i  +  r2 +  ra(— l)*“Vy«A  “  0.  (C13) 

Bv  multiplying  Eq.  (C13)  by  and  subtracting  the  result 
from  Eq.  (Cl 2).  we  obtain 

A,y;  +  +  Ajf-DV;.*  -  0,  (C14) 

where  Xi  •  o\,  Xj  •  <rj  -  ap'\tr%  and  Xa  •»  <rjra/ra.  We  have 
X|>0andXa>0.  We  will  show  that  X2  >0.  FromEq,  (C14) 
we  obtain 

V/.i  “  -V/  +  (Ci5) 

We  consider  two  cases, 

Case  1 

h  is  odd;  then  k  -  1  is  even,  and,  by  Eq.  (C15), 

*aMt  “  “*t yj  +  (Cl6) 

and,  by  using  lemma  A3, 

MMAel  “  Ml'Ml^ 

*  ■“^i((?/e*ei  “  Qj+v  .V/)  +  y;#*) 

“  Ai  (Mi  "  M*e t*  >/)  ^  ~  Mi'  M*) 

>0.  (C17) 

Since,  by  lemma  A3,  (<m*« i  “  <7/ei*  y/ei )  >  0,  it  follows  from 
expression  (C17)  that  X2  >  0. 

Case  2 

k  is  even;  then  k  -  1  is  odd,  and,  by  Eq.  (CIS), 

'X'jy/ei  “  -hy,  ~  V,**!  (C18) 

and,  by  using  lemma  A3, 

^(M*  ~  M i>  y/e  1 1  m  ^  1  ( 1  —  M*>^ 

+  M?;*l  “<?/•*' M*> 

>  0.  (C19) 

Since,  by  lemma  A3,  (q;#*  -  q/al,  y;*|)  >  0,  it  follows  from 
expression  (C19)  that  X3  >  0, 

It  remains  to  be  shown  that  y,,  y,*i,  and  (-l)*yy«*  are 
noncollinear,  Since  X|,  X3,  X;t  >  0,  it  follows  from  Eq.  (C14) 
that  0  £  [y,,  y;„,  Therefore,  if  y,,  yj9i,  and 

( ~  1  )*>;•*■  are  collinear,  then  they  must  all  lie  on  a  line 


through  the  origin,  However,  since  we  have  already  shown 
that  0  £  int[y/,  Mi>  Mil*  >7  a*»d  Mi  cannot  lie  on  a  line 
through  the  origin.  Therefore  yJt  yy#i,  and  (-l)*y/e*  are 
noncollinear,  and  hence  0  £  int[>»;,  y;* ,,  (-l)*y;**].  This 
completes  that  proof  of  lemma  C2. 

In  order  to  prove  that  the  loop  is  not  infinite,  It  will  be 
shown  that  the  parameter  n  in  the  program  in  Section  5  can 
fail  to  be  incremented  on  at  most  T  -  2  consecutive  passes 
through  the  loop.  The  proof  will  be  by  contradiction.  Ac- 


cordingly,  assume  that  n  is  not  incremented  on  T  —  1  consec¬ 
utive  passes  through  the  loop. 

Let  k  and  <t>  have  the  values  that  they  have  at  step  2  of  the 
first  of  these  T  -  1  pastes.  Let 

ba  -  mini;;  OSySAf-T-l  and  0{dk9aj)  - 

i|  (C20) 

for  a  -  0 . T  -  2;  then 

Mea. bj  -  1 

(C21) 

and 

*(«*•<!  -  d*ea,ti.>  “  1 

(C22) 

for  a  ■  0, ....  7*  -  2.  Let 

Xa  m  fdk**,b’  if  *  0 

~  otherwise 

(C23) 

We  have 

A*.a(xa)  -  lA*,0(tf*.,i4,)|. 

(C24) 

if  x  £  2a  and  <t>(x)  -  1,  then  x  £  D  and  x  «  d*.0<, 
for  some  j  £  ba.  Therefore  l/»**0(*)l  “  lh*#a(dt*aj)l 
£  lh*#0(d*.ats,)l  -  hkmii(xa).  Thus,  for  x  £  Z\ 

0(x)  ■  1  -*  lh*#(i(x)l  H  h^#a(x0). 

(C25) 

Claim  Cl 

For  a  *  0, ....  T  -  2, 

(-l)a<*«  y*.(7W)>>0.  (C26) 


Proof  of  Claim 

First,  we  will  prove  the  claim  for  a  *  0.  Since  by  Eqe.  (C21  )- 
(C*3),  #(«*  -  xo) m  1,  it  follows  that  a*  -xo  E  O.  Therefore, 
by  lemma  A3, 

(a*  -  Aoty*e(r-i)>  <  (My/keir-n)'  (C27) 
or,  since  «*  *  q*  +  qui. 

<*o  “  ffsaiiytair-u)  >  0*  (C28) 

Thus  the  claim  is  true  for  a  ■  0.  Now  let  1  £  a  £  T  -  2  and 
assume  the  claim  is  true  for  a  —  1.  By  Eqs.  (C2l)-(C23), 
<£'«**„  -  *«)  "  1,  and  hence,  by  using  implication  (C25), 

(l**(o-n(a*#B  —  l(*iieia-i)(tl(^ea  —  **«)! 


or,  equivalently, 

(C29) 

<x^_ 

(C3C) 

Similarly,  J>(.r0-i) 

*  1,  and 
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~^kta^xa-0  ^  ^*eai'*a-l)l 

~  h^Ba(xa) 

*  ~^*ea!a*en  ■Tali 

(C31) 

and  therefore 

^Aeai^Aea  —  ^  hk#fl(xa_|), 

(C32) 

or,  equivalently, 

(*a-i  —  ak*t  "b  x„,  ynj0)  i  0. 

(C33) 

By  lemma  C2,  0  £  int(y*«ia-».  y*« «,  (-D^yucr-nl,  and 
therefore  there  exist  .strictly  positive  real  numbers  <ri,  era, 
such  that 


tfty*e<o-n  H y*a«  c'3("l)7'"’ay*a(r-i)  ■  0-  (C34) 

Since,  by  lemma  A2,  T  is  odd,  (— l)r-a  «•  -(-i)#,  and  from 
Eq.  (C34)  we  obtain 

(C35) 

By  Eq.  (C36)  and  expressions  (C30)  and  (C33), 

-  «*•.  +  *„<  3>k«r-  d> 

"  ^  *a' 

+  *«(*a-l“«*a.  +  *aly*aa> 

2S  0,  (C36) 

and  since  crj  >  0,  k 

<-l)a<*u-t  ~  y*#(T-n>  a  o,  (C37) 

By  substituting  <*«•„  ■  <?*«.  +  q*,a#l  and  using  expression 
(C37)  and  the  induction  hypothesis,  we  obtain 

(-l)°(xa  —  y*«,T-n> 

a  —(— i)a(x0_1  ~  y**\r-'.)} 

-  (-I)0"1  y*.,r-D> 

>  o.  (C38) 

This  completes  the  proof  of  the  claim. 

Set  a  -  T  -  2  in  expression  (C26).  Since,  by  lemma  A2,  T 
-  2  is  odd,  we  obtain 

y*eiT-i>)  <  i</*e(T-i)>>,Aair-u)'  (C39) 

It  now  follows  from  lemma  A3  that  xr-i  4  D,  and  therefore 

■  0,  which  contradicts  either  Eq.  (C21)  or  Eq.  (C22). 
Therefore  n  cannot  fail  to  be  incremented  on  each  of  T  -  1 
consecutive  passes  through  the  loop.  This  completes  the 
proof  that  the  loop  is  not  infinite. 

It  now  follows  that  the  program  produces  sequences 
qn  ■  ■  • ,  C/v-i  and  mr, ....  ntiv- 1.  It  remains  to  be  shown 

that  if  q  -  too, .  . ,  qn-i)  and  m  ■  (mr . m/v-t),  then 

(q,  m)  is  a  reconstruction  algorithm  for  the  mas.:  Af. 

We  have  R(M)  -  too . qr-il*  Let  T  3  n  3  N  -  l,  and 

let  all  variables  have  the  values  that  they  have  after  step  5 
and  before  Btep  6  of  the  pass  through  loop  in  which  qn  and  m„ 
are  defined.  By  the  definition  of  b  in  step  1  of  the  loop,  for  x 

ezs 

lA*(d*,6)l  <  \hk(x)\  m*  0(x)  -  0.  (C40) 
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If  x  £  D,  then  before  entering  the  loop  0  had  to  have  the 
value  1  at  x,  Thus,  if  the  current  value  is  0(x)  ■  0,  0  must 
have  acquired  the  value  0  at  x  on  some  preceding  pass;  that 
is,  x  ■  q„'  for  some  n'  <  n.  Therefore  x  6  too,  • . . ,  (fa-il- 
Thus,  for  x  £  D, 

0(x)  -  0  —  x  £  too . (C4I) 

First,  it  will  be  shown  that 

M  n  (Af  +  qn  -  qm)  c  too . <?„!•  (C42) 

Let  x  £  Af  n  (Af  +  qn  ~  qm„ )•  We  want  to  show  that  x  £ 

|<?o . <?n|,  If  x  £  R(M),  then,  since  n  2  T,  we  are  done, 

Since  qn  ■  dklb,  ifx  ■  d^.s,  we  are  done.  Now  assume  that  x  4 
R(M)  and  x  d  d*,e.  Since  x  4  R(M),  x  £  D. 

Claim  C2 

l**Ww)l  <  lh*(x)|.  (C43) 

Proof  of  Claim 

The  proof  of  the  claim  will  be  divided  into  two  cases. 

Case  1 

A*(d*l4U  0,  (C44) 

Since  qn  ■  <!*,*,  it  follows  from  step  5  of  the  loop  that  m„  ■ 
k>  Let  *  ■  x  -  d*,s  +  qk>  Since  x  £  Af  +  dk,b  -  q*,  it  follows 
that  x  £  A#.  Also,  since  x  d  </*,(,,  t  d  qk.  Therefore,  by 
lemma  A3, 

-  <*,y*>  “  (045) 

or  (dn.t,  yi,)  <  (x,  y*).  It  follows  that  A*(<f*,i)  <  h*(x),  and 
since  %  0,  the  claim  follows. 

Case  2 

4)  <  0.  (C46) 

In  this  case  m«  ■  A  ®  1.  Let  x  «  x  -  d*,*  +  <7**1.  Since  x 

6  Af  +  dk,b  ~  9*«i,  it  follows  that  z  £  Af.  Also,  since  x  d 
dk,b,  x  d  Therefore,  by  lemma  A3, 

(x,y*>  -  <d*lfc,y*)  +  <<7*#i,y*>  ■  <r,y*> 

<  <9»#i.y*>.  (C47) 

or  <x,  y*)  <  (dk,b,  y*>.  It  follows  that  h*(x)  <  h^d^t,),  and 
since  hi,(dk,b)  <  0,  the  claim  follows.  This  completes  the 
proof  of  the  claim. 

It  follows  by  Implication  (C40)  that  0(x)  ■  0,  and  since 
x  £  D,  it  follows  from  implication  (C41)  that  x  £  too, . . . , 

4n-i!  s  too . <?nl>  This  completes  the  proof  of  relation 

(C42). 

It  remains  to  be  shown  that 

Af  n  (Af  -  dh<b  +  qmJ  a  too, . %-il  (C48) 

Let  x  £  Af  n  (Af  ~  dk,b  +  We  want  to  show  that 
x  £  too,  •  •  • ,  4n-i),  Since  n  Z  T,  If  x  £  R(M),  we  are  done, 
Assume  that  x  4  fl(Af);  then  x  £  £>.  The  proof  will  be 
divided  into  two  cases. 


244 


134  J.  Opt,  Sue,  Am,  A/Vol.  4  No.  1/January  1987 


Thomai  R,  Crlmmlna 


Case  1 

Md) w)  *  0. 

In  this  case  m„  -  k,  Let  z  ■  x  +  d*,/,  -  <j*.  Since  xgM- 
do  +  <ik<  It  follows  that  i  g  Af, 

If  r  -  then  x  -  <j*  +  v**i  -  d*,s  ■  o*  -  d*,*.  If 
<Mx)  ■  1,  them  by  step  2  of  the  loop,  </„  would  not  have  been 
defined  on  this  pass,  contrary  to  the  assumption  that  it  was. 
Therefore  <t>{x)  ■  0,  and  by  implication  ;C41),  .x  g  {<7o . 

Assume  that  x  f*  then,  by  lemma  AS, 

U.y»>  +  (dkth,yk)  -  (qk,yk)  -  <*,y*> 

<<  fleet .y*>.  (C49) 

Recalling  that  dt  ■  Ids  +  flee  t)/2,  it  follows  from  expression 
(C49)  that 

+  (C50) 

or  /te(de.e)  <  ~Mx),  arid  since  Asides)  £  0,  it  follows  that 
l/ie(de,e)l  <  lhe(x)l.  Now  by  implication  (C4C),  4U)  ■  0,  and 
by  implication  (C41),  x  g  |<)o, . . . ,  g«-t|. 

Casa  2 

he(de,e)  K  0. 

In  this  case  m,  ■  it  •  1,  Let  z  ■  x  +  de,e  -  qeei-  Since 
x  g  W  -  d/,,1,  +  i/ktti  It  follows  thatx  g  M. 

If  x  ■  9*,  then  *  -  <?e  4-  qeei  “  de,e  ■  a*  -  de,e,  and  by  the 
argument  given  in  case  1  above,  0(x)  “  0,  and  by  implication 
(C41),  X  g  |<7o<  •  •  *  I  flit-ll- 
Assume  that  t  <*  Qei  then,  by  lemma  A3, 

■  +  <<W*>  -  (<jk9l,yk),  (C51> 


from  which  It  follows  that  -hi,(x)  <  /i*(d*,s).  Since  hk(dk,b) 
<  0,  it  follows  that  lMd*,(>)|  <  lh*(x)l,  Hence,  by  implication 

(C40),  0(x)  ■  0,  and  by  Implication  (C41),x  g  |<?0 . 

This  establishes  relation  (C48)  and  completes  the  proof  that 
(q,  m)  is  a  reconstruction  algorithm  for  the  mask  Af. 
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„  Appendix  E 
Ves*(g(x) )FOR  COMPLEX  OBJECTS 

In  this  appendix  we  generalize  the  expression  for  the  gradient  of 
the  summed  objective  function  to  Include  objects  and  object  estimates 
that  are  complex  valued. 

Recall  that  the  summed  objective  function  Is  defined  as  the  sum  of 
a  generalized  object-domain  error  metric  and  the  Fourler-domaln  error 
metric: 

•,2  ■  eo2  +  *F2*  (E-l  ) 

where 

‘  £  |9(*)|2.  (E-2 ) 

xeS' 

and 


•f2  *  N‘2E[l6(“)l  -  |F<U)I]Z.  (E-3) 

u 

Notice  the  summed  objective  function  Is  Implicitly  a  function  of  the 
real  and  Imaginary  parts  of  the  pixel  values  for  the  latest  estimate, 

g(x).  We  therefore  treat  the  real  and  imaginary  parts  of  each  pixel  as 

2 

distinct  parameters  that  can  be  adjusted  In  order  to  minimize  e$  .  We 
express  the  real  and  Imaginary  parts  of  the  latest  estimate  as 
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g(x)  ■  a(x)  +  1b(x). 


( E-4 ) 


We  define  the  gradients  to  be: 


Ve2(g(x))  -  2 
8  j-1 


3a(xj) 


3b(Xj) 


(E-5 ) 


where  VjR  and  Vj1  are  orthogonal  unit  vectors  associated  with  the  real 
and  Imaginary  parts  of  the  pixel.  The  first  partial  derivative  In 
Eq.  (E-5)  may  be  separated  Into  two  terms: 


3e$2  3ep2  3e02 
3a ( x j )  "  3a(Xj)  +  3a(Xj) 

For  the  moment  we  examine  the  first  term  In  (E-6): 


It  Is  easy  to  see  that 


N'2  £  [ lG(u)  I  -  lF(u)l]2 

U 

( I G<u) |  -  | F(u) | )  • 

J 


iliiM 

3aTx 


j 


1  3 1 G(u) 1 2 

'2TWi«txjr 


(E-6) 


(E-7) 

(E-8) 


,  f  12ttii*Xj/N 
“  2T'fi'(uTf  lG(u5  e 


(E-9) 
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Substituting  Eq.  (E-9)  Into  Eq.  (E-8)  yields 
ge  2  1  2ttu  •  x*  /N 

83TTT  ‘  N'2  I  '  IK“>I>  |slu)  *C'C'1 

J  u 

■  (g(xj)  -  g'(xj))  +  (g*(xj)  -  g'*(xj)) 

•  2(«{Xj)  -  a'(xj)). 

This  result  Is  consistent  with  the  result  quoted  for  real, 
objects.  A  parallel  derivation  gives 


iSTxjT  •  2(b(xj)  -  b'(xj))  . 


Me  now  return  to  the  second  terra  In  Eq.  (E-6): 

l  i9<*>i2 

J  J  XgS  ' 


Similarly, 


rilin'  Z  j2(x)  +  b2(x) 

J  *eS ' 


2a (Xj }  ,  Xj  e  S' 


•  *j  e  s  . 


3V 

mqrm\ 


(2b<Xj)  ,  Xj  e  S' 


,  Xj  e  S 


(El-10) 


valued 


( E-l 3 ) 


(E-14) 


(E-l 5) 
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Let 


S*  ( x) 


li  x  e  S' 
0,  x  c  S 


Now  Eqs.  (E-14)  and  (E-1S)  may  bt  expressed  more  conveniently: 

H* 

3irxp-*2*(xj)s'txj) 


and 


8*o . ■  ZbU^S'UJ  . 

oBTxJT  J  J 


Collecting  these  results,  m  have: 


351^7  -  2ta(Xj)  -  «’(*j)3  +  2*(Xj)S‘(Xj) 

3‘,2 

WtJt-  *  »(Xj)S'(Xj) 


Equations  ( E-19)  and  < E-20)  may  be  combined  to  form  a  complex 
Image. 
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L.MJLTI  Ji-TMJ’IJ 


( E- 1 6) 


(E-17) 


(E-18) 


(E-19) 


(E-20) 


gradient 


Gradient  Image  =  2[g(x)  -  g'(x)]  +  2g(x)S'(x)  (E-21) 

The  extension  to  complex-valued  objects  still  requires  only  2  FFTs  to 
compute  the  gradient. 


„  Appendix  F 
Ve0Z(fl(x) )  FOR  REAL  objects 


Recall  that  the  object-domain  error  metric  Is  Implicitly  a  function 
of  the  Input  estimate  g(x)  and  is  given  by: 


•oZ(0(K»-  XJCB'(x)]2.  <M) 

XtS 

Its  gradient  with  respect  to  the  Input  pixel  values  may  be  written: 

nJ  3,2 

L  aa?*TT  VJ  <F'2> 

j-1  J 

where  Vj  Is  a  unit  vector  In  the  direction  of  the  parameter  g(Xj)  In 
parameter  space.  We  focus  now  on  the  partial  derivative  that  appears  In 
Eq.  (F-2) : 


Sfl 

3g(xj ) 


2  Z  s'(x)  i?CT!,W  • 

xeS'  J 


( F-3) 


Substitution  of  the  Fourler-domaln  expression  for  g'ix)  gives: 

/ 


se 


2 


3g(xj)  "  2  Z  9'^  agTxJ) 
J  xeS'  J 


N*2  £  G'(u)  e12lTU**/N 


;f-4) 


Recall  that 


I 


giving 


5g?*TT“  2  I  9'<*>  SgUTT'"'2  £ 

w  vpC1  «  n 


■!iw 

u 


JEiru'X/N 


■  2N"2  Yt  9‘ (x)  Y  |F(u)|e12mi*x/N  ^2— • 


(  F-6) 


In  order  to  evaluate  th*  partial  darlvatlva  In  Eq*  (F-6)  w«  naad 


axp rati Ions  for  and  ; 

J  J 

HlSjr  ‘  snip- 1 s(i<)  •* 


12iru*x/N 


•  l  ••i2’u'x/N  KBjr 


-12iru-Xj/N 


Tho  darlvatlon  for  requires  tha  result  in  Eq.  (F-7); 

J 


•  fiw  3g^jr|a(u)l' 


■  JITBlW  [a*(u)  *  c,c'] 

,  (*  -1 2nu  •  x  j /N 

■  JR5TT  [a*(u)  4  *  C 


L__ 


(F-7) 


(F-8) 


where  C.C.  stands  for  complex  conjugate  of  the  explicit  term.  Using  the 
results  In  (F-7)  and  (F-8)  and  with  some  algebraic  manipulation  the 
partial  derivative  In  (F-6)  becomes 


a  G(u 

agTxjJ  TO 


-12nu*Xj/N 


-  C.C. 


(F-9) 


Substituting  Eq.  (F-9)  back  Into  Eq.  (F-6)  yields 


'127tu*x4/N 


asTiCT  *  N"2  E  s'U)  £  |F(u)|  G*(Ul>  CliriiyTTO~ 


CJ  e12itu*x/N 
(F-10) 


By  changing  the  order  of  summation  we  have 


SgTxJT  «  N* 


lfr(u)l  G 

11 


12ru-Xj/N 

ruT!TO~ 


‘‘‘ll 

J  XpS1 


g'(x)  e12lTU'x/N  . 
(Ml ) 


At  this  point  It  Is  convenient  to  define  the  characteristic  function  of 
the  complement  of  the  support  S  as  follows 


S'(x) 


1  »  x  e  S' 


0  •  x  e  S 


(F-12) 


The  second  summation  In  (F-ll)  may  now  be  rewritten 


9 ' (x)  e127TU,x/N  „  S'(x)g'(x)  e1ZlTU,x^N 


(F-  13) 


The  error  In  the  output,  g#(x),  consists  of  that  component  of  the  output 
that  violates  the  support  constraint: 


9e(x)  -  S'(x)g'(x) 


(F- 14) 


The  summation  In  Eq.  ( F- 13)  has  the  form  of  a  forward  DFT: 


e12uu  *x/N 


i 


g.U) 

© 


“  ae(-u) 


( F- 1 5 ) 


where  G#(u)  Is  the  DFT  of  g#{x).  The  total  partial  derivative  may  now 
be  written: 


2 

3eo 

agTxJT 


Z 


|F(u)|6  (-u)f  -12ttU'Xj/N 

jmmn  G*(u)  8 


c.c. 


u.2  ^  |F(U)|G  (-u)  -127TU •  x^/N  m7  ^  |F(u)|Ga(-u)G(u)  I2rru* 

■N  L  ~ TW  6  - N  L  | S(u) 1 6*(u) - • 

"  U  (F-16)  ' 

Using  Eq.  (F-S)  In  the  second  term: 

3eo  -2  r*  I F(u) |Ge(-u)  -127TU*x1/N  «  G'(u)G  (-u)  12iru*x./N 

39TSTT  ■  N  L  — [etuTI —  *  -N  I— w~!  3 

J  u  u  (P-17) 


Remarkably  both  of  these  terms  have  the  general  form  of  DFTs.  In  order 
to  combine  both  of  these  terms  Into  a  single  Inverse  DFT  m  perform  a 
sign  change  of  variable  on  the  Fourier  vector  In  the  first  term.  The 
net  result  is: 


3*5 

5gfx 


■J* 


|F(-u) |G0(u) 
| 6( -u) ] 


Q'(u)Ge(-u)“ 


12nu*x./U 
e  J 


(F-18) 


Finally,  by  appealing  to  the  Hermltlan  property  for  Fourier  transforms 
of  real  functions  we  may  make  the  following  substitutions: 


I  F(-U')  |  •  |  F(u)  | 
|o(-u)|  .  |e(u)| 


G,(-u) 


Ge*{u) 


(  F-19) 


to  got  tht  final  result: 


|F(u)|Ge(u)  G'(u)Ge*(u) 

. ttoi - to~ 


12iru*Xj/N 


(  F«20) 


.  Appendix  G 
VeQz(g(x) )  FOR  COMPLEX  OBJECTS 

The  derivation  of  the  gradient  of  the  eQ^(g(x))  objective  function 
for  the  case  of  complex  objects  closely  follows  that  for  real  objects 
(Appendix  F).  Therefore,  we  just  highlight  this  derivation,  calling 
attention  to  significant  differences.  We  begin  by  acknowledging  that 
the  complex  case  admits  twice  as  many  parameters;  namely  the  real  and 
Imaginary  parts  of  each  input  pixel.  We  denote  the  real  and  Imaginary 
parts  of  the  Input  function  as  follows: 

g(x)  -  a(x)  +  1  b(x).  (G-l) 

We  define  the  gradient  to  be: 


where  v^  and  Vj1  are  orthogonal  unit  vectors  associated  with  the 
parameters  of  the  real  and  Imaginary  parts  of  the  pixel  at  location  Xj. 
The  partial  derivative  of  the  objective  function  with  respect  to  the 
real  part  of  a  pixel  value  may  be  written 


The  extra  complex-conjugate  term  appears  because  the  output  functlo* 
g'(x),  can  assume  complex  values.  The  partial  derivative  with  respec 
to  the  Imaginary  part  Is  similarly  found: 


5BTET-  k2  E  9'*(*)  Y  I F(u) | 
J  L  xeS'  U 


.12m  *x/N  a  (  G 


rojTjjG 


Sll 


+  c.c. 

( G-4) 

With  simple  algebraic  manipulations  the  following  useful  identities  may 
be  verified: 


(0-5) 


3b(Xj7  16 


12m*Xj/N 


3  I  G< 


3a(Xj) 

ziw 

8|S(u)| 

1 

"  2TOTJ1 

the  aid 

of  Eqs.  (G 

3  1 

|  G(uj  |  „ 

3  a  (Xj) 

(TTOT  i 

-12m*x^/N 


-i<.iiU*x,/H 
i)  e  J 


+  c'c] 

-  c.c.l 


-12ttu  •  Xj /N 


tlTOTBW 


-  c.c, 


(6-6) 

(6-7) 

(G-6) 


(Q-9) 
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a 


and 


127W*x,/N 

J  -  C.C 


1G*(u)  e 

—  ^TSruTl^TuT 


(Q- 10) 


When  Eq.  (G-9)  Is  substituted  into  Eq.  ( 6-3)  and  tha  o»der  of  summation 
Is  Interchanged  we  get: 


38o 
aaixj ) 


N-2 

T 


-12t(u«Xj/N 


-  16H.U  * 

I  i  s'(»)o'*(x)  .,2,u- 


x/N 


+  C.C. 


(G-n) 

were  S'(x)  is  the  characteristic  function  of  the  set  S',  as  before. 
Recall  the  object  and  Fourler-domaln  expressions  for  the  error  Image: 


gt(x)  ■  S'(x)g'(x)  (g- 1 2 ) 

6f(u)  “  ]T  S'(x)g'(x)e“12TrU*x/N  •  (q.  13) 

x 

We  may  therefore  write 

G#*(u)  *  £  S'(x)g'*(x)e12lru'x/N  (G-14) 

x 


which  may  be  substituted  Into  (G-U)  to  get: 


3a(Xj) 


‘  ,  t  -12ttu*x./N  i 

n-2  ^  |F(u)|.Ge*(u)  jG*(u)  e  J  -  C.C.} 

T  L  - LWu]'TS'*(u) - 1 

u 


+  C.C.  (g-  15) 
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By  similar  steps  we  find 


o  r  i  -12itU,Xj/N  \  M 

3e2  L-2  _  |F(u)|6  *(u)-tlG*(u)  e  1  -  C.C.} 

aETxJT  "  [t-  2- - - ~[5fu)|6*(u) - J  *  c'c’ 


Explicitly  writing  out  the  complex-conjugate  terms  In  Eqs.  (6-15)  and 
(G- 16)  and  performing  a  few  additional  manipulations  we  produce  the 
final  result. 


38o 

3a(Xj) 


mp 


jf(.u)|Ga»(-u) 

— [SF3TT~ 


|F(-uHse*(-u) 

— r<sr-u>i  + 


G'(u)Gfi*(u)  12iru.Xj/N 
~'fi*(u) -  e 


G'(u)G*(u)1  I2rru-X,/N 

5'*(u')  J  e  J  ' 


(G-17) 


( G-18) 


It  Is  gratifying  that  Eq.  (6-17)  Is  consistent  with  Eq.  (F-18)  which  Is 
the  equivalent  partial  derivative  for  real  objects  only.  It  Is  worth 
mentioning  that  because  the  summation  arguments  In  Eqs.  (G-17)  and 
(G-18)  differ,  an  additional  FFT  Is  required  In  the  computation  of  the 
gradient  for  complex  objects.  The  total  number  of  FFTs  (forward  or 
Inverse)  needed  Is  Increased  to  five. 


i 

I 
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Appendix  H 


DATA  PROCESSING  IN  THE  PHASE  RETRIEVAL  LABORATORY 

The  portable  laboratory  data  acquisition  and  analysis  system  can 
acquire  Image  data  via  a  Fairchild  CCD3Q00  camera;  digitize,  Integrate, 
and  process  512  by  512  Image  data  via  Imaging  Technology  video  hardware; 
and  execute  algorithms  requiring  FFT  and  other  computationally  Intensive 
operations  (such  as  the  Iterative  Fourier  transform  algorithm  for  phase- 
retrieval)  using  a  Mercury  ZIP  3232  16  Mflop  array  processor.  The  host 
computer  Is  a  Neurlkon  68000-based  system  using  the  UNIX  operating 
system.  An  outline  of  the  functions  of  the  experiment  control  software 
Is  given  below. 

1.  Data  acquisition 

Digitize:  Digitize  and  store  Image  In  Imaging  Technology,  Inc, 
(ITI)  frame  buffer  with  correction  for  calibrated  nonuniformities. 

Integrate:  Digitize  n  Images  and  sum  In  array  processor  (AP) 
with/without  normalization. 

2.  Image  display 

Display:  Display  AP  and  hard  disk  Images  on  ITI. 

Notes:  (1)  Conversion  from  32  bit  to  8  bit  data  If  desired 
(2)  Many  options: 

Display  real,  Imaginary,  magnitude,  magnitude- 
squared,  or  phase 

Apply  bias  and  scale  (as  In  y»ax+b) 

Display  absolute  value 

Magnify  by  2,4,8,...  (specify  subimage  to  be 
displayed) 

Sample  to  give  256x256  Image  and  display  In  specified 
quadrant  of  ITI  display  (allows  four  Images  to  be 
displayed  simultaneously  for  comparison) 

Display  any  size  Image  In  any  location  of  display 
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(3)  Values  above  and  below  the  8  bit  range  of  the  ITI 
are  clipped  at  0  and  255 

Llve/Memory:  Toggle  between  displaying  video  Incoming  to  ITI  and 
data  In  ITI  frame  buffer. 

3.  Image  algebra  (all  In  AP) 

Adds  Add  two  Images. 

Subtract:  Subtract  two  Images. 

Multiply:  Multiply  two  Images. 

Divide:  Divide  two  Images  with  user  definable  result  for  divide 
by  zero. 

Scale:  Add  bias  and  scale  Image. 

Threshold:  Hard  limit  above  and  below, 

Logic  operations  between  binary  Images. 

Magnitude:  Find  magnitude  or  magnitude-squared  of  an  Image. 
Phase:  Find  phase  of  an  Image. 

Convert:  Change  real  Image  to/from  complex  Image. 

Print:  Print  values  of  specified  small  part  of  an  Image. 
Statistics:  Find  mean,  variance  of  Image  and  magnitude-squared 
of  an  Image. 

Maxmln:  Find  max  and  min  values  of  Image. 

Histogram:  Compute  histogram  of  Image  and  display  on  ITI. 
Convolve:  Convolve  Image  with  a  small  specified  convolution 
kernal  (allows  smoothing  and  other  operations  on  data). 
Interpolation:  Interpolate  from  one  sample  spacing  to  another. 

4.  Create  Images  (In  AP)  for  test  purposes 

Zero:  Zero  fill  an  Image. 

Create:  Place  a  rectangular,  circular,  or  triangular  region  of 
specified  complex  value  at  a  specified  position  In  an  Image. 
Aperture:  Multiply  Image  by  a  binary  rectangular,  circular,  or 
triangular  aperture  located  at  a  specified  position. 


Noise:  Add  zero-mean  Gaussian  noise  with  specified  variance  to 
an  Image  (specifying  seed  so  that  same  or  different  set  of 
random  numbers  can  be  generated)  (Also  uniform  and  Poisson 
nol se) . 

5.  Iterative  algorithm 

Setup:  Allocate  and  load  image  domain,  Fourier  domain,  Image 
domain  constraint,  Fourier  magnitude  constraint,  and  buffer 
arrays  In  AP. 

Iterate:  Iterate  n  times  using  specified  form  of  Iterative 
algorithm,  computing  and  printing  error  measures. 

Display:  Display  Intermediate  results. 

Save:  Save  results. 

6.  Image  error  computation 

Error  measure:  Compute  normalized  root-mean-squared  error  of 
complex  image  or  of  image  magnitude  relative  to  reference 
object,  taking  Into  account  Intensity  scaling  and  (for  complex 
Images)  constant  phase  shift. 
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Appendix  I 

STOCHASTIC  VS  DETERMINISTIC  APPROACHES  TO  PHASE  RETRIEVAL 
1.1  INTRODUCTION 

When  a  !>AR  system  operates  under  Ideal  conditions,  the  phase 
variation  occurring  from  one  received  echo  to  the  next  Is  known.  This 
knowledge  Is  employed  In  the  receiver  by  building  a  "matched  filter"— 
matched  to  the  amplitude  and  phase  characteristics  of  the  train  of 
echoes  that  are  processed  during  the  coherent  Integration  Interval. 

In  some  circumstances  the  phase  variation  from  echo  to  echo  Is  not 
completely  known.  This  may  occur,  for  example,  due  to  uncompensated 
platform  motion  effects,  unknown  target  accelerations,  atmospheric 
effects,  etc.  In  these  cases,  the  lack  of  complete  phase  Information 
can  be  modelled  by  considering  the  Ideal  situation  (no  phase  error)  to 
be  perturbed  by  the  presence  of  phase  errors,  en(t),  occurring  on  each 
(nth)  echo. 

One  approach  for  reducing  the  deleterious  effects  of  uncompensated 
phase  errors  on  the  SAR  Imagery  Is  to  remove  the  phase  In  the  recorded 
(pre-processed  data),  preserve  the  magnitude,  and  attach  a  pre-selected 
phase  history  using  a  priori  Information  about  the  shape  of  objects  In 
the  final  Image.  In  this  memo  this  approach  Is  referred  to  as  the 
deterministic  approach  to  phase  retrieval  to  distinguish  It  from  another 
approach  described  next. 

Usually  the  phase  perturbations  arise  from  a  physical  mechanism  that 
can  be  modelled  as  a  slowly  varying  (1 . e. ,  from  pulse-to-pulse) 
stochastic  process.  If  the  statistics  of  that  process  are  known,  then 
the  phase  perturbations  can  be  compensated  out.  To  account  for  spatial 
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variations  In  the  statistical  model,  however,  It  Is  necessary  to  regard 
the  perturbation  statistics  as  only  partially  known. 


In  the  alternative  phase  retrieval  scheme  suggested  here  the  phase 
errors  are  regarded  as  a  zero  mean  Gaussian  random  process  whose 
covariance  function  Is  known  up  to  a  finite  number  of  unknown 
parameters.  These  unknown  parameters  are  estimated  during  the  actual 
system  operation,  using  the  received  (phase-corrupted)  data  along  with  a 
priori  knowledge  about  the  dynamics  of  the  physical  process  that 
actually  caused  the  perturbations.  Thus,  the  random  process  approach  to 
phase  retrieval  exploits  a  different  type  of  a  priori  Information  than 
the  deterministic  approach,, 

Depending  upon  the  system  being  modelled  and  the  scene  being  Imaged, 
It  Is  likely  that  these  alternative  phase  retrieval  methods  will 
demonstrate  different  degrees  of  Image  enhancement.  The  deterministic 
approach  should  perform  best  when  the  variety  of  Images  being  sought  Is 
limited  to  a  sufficiently  small  set  that  each  possible  attached  phase 
function  can  be  considered.  The  random  process  approach  ought  to  be 
favored  when  the  scene  Is  highly  variable  but  where  the  physical  causes 
of  phase  perturbation,  Itself,  can  be  modelled  as  a  (low  dimensional) 
parameter  estimation  problem. 

Below  we  outline  the  technical  basis  of  stochastic  phase  retrieval. 
We  begin  by  showing  how  the  actual  observables  furnished  by  the  sensor 
are  affected  by  random  phase  errors.  We  then  show  that  If  the 
covariance  function  of  the  phase  errors  were  known,  then  a  complete 
characterization  of  the  desired  Image  could  be  formulated,  just  as  In 
the  deterministic  case.  It  Is  then  seen  that  the  deterministic  phase 
retrieval  method  attempts  to  estimate  the  proper  phase  for  the  actual 
waveform  received  from  a  target  scene;  while  the  stochastic  approach 
attempts  to  estimate  the  proper  phase  of  the  waveform's  covariance 
function. 


268 


l.J 


1.2  TECHNICAL  BASIS 


If  en(t)  has  appreciable  high-frequency  energy,  (l.e.,  If  It  changes 
rapidly  In  time)  then  even  as  one  range  sweep  (an  echo)  arrives  at  the 
radar,  we  must  account  for  the  phase  variation  that  occurs.  It  can  be 
shown  that  In  this  case,  the  result  of  employing  the  processing 
described,  e.g.  In  Brown  [1.1]  is  to  produce  the  "observables"  on  one 
(the  nth)  ech0  (l.e,,  one  range  sweep): 

Q(w,un)  -  /  Z(ro)  eJ^ndo)  Iedo.«)  dV(r0)  (M) 


where 

■  J  [w[r  +  (Iq) 3  exp[jen(r)]]  e"Jwr  dr 
r 


and  where: 

r0  denotes  a  vector  drawn  from  the  origin  of  a  coordinate  system 
centered  somewhere  on  the  target  (scene)  to  each  element  of  volume, 
dV(r0),  contributing  to  the  range  sweep, 

R  defines  the  region  of  space  that  contributes  to  the  range  sweep. 
(Normally  R  is  determined  by  the  receive  system's  antenna  gain). 

1s  a  scalar  variable  that  depends  on  the  vector  r0  defined 
above  and  upon  the  unit  vector,  un,  defined  along  the  radar's 
Hne-of-slte  during  the  round  trip  path  traversed  by  a  single 
transmitted  pulse  and  its  echo.  (Platform/target  motion  Is 
assumed  "frozen"  during  the  time  required  for  the  transmitted 
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pulse  to  traverse  the  Interval  of  range  contained  In  the  range 
sweep) . 

W(r)  $  ApLEjQxp|j^[££j  j  I3  the  complex  envelope  of  the  transmitted 

pulse,  expressed  as  a  function  of  r  *  ct/2. 

«n(r)  Is  the  phase  error  occurring  on  the  n^  echo. 

Let  us  examine  Ia(r0,w)  and  show  how  our  expression  In  (1-1)  relates  to 
Brown's  corresponding  result.  (We  will  see  that  we  obtain  Brown's 
result  If  i  i  0,  which  corresponds  to  the  Ideal  case  of  no  phase 
errors.) 

Using  Brown's  notation  define 

« 

Y(«)  “  J  W(r)  e"^wr  dr  ,  (1-2) 

Also,  let 

h^(r)  i  exp[jen(r)] 
and 

« 

H^(tf)  »  j  h^(r)  e”^wr  dr  .  ( 1-3) 

-* 


It  then  follows  that 


1,(10,  ")  ■  {e+J*"  ¥(«)}  *  {h'J1 («)}  (1-4) 

where  the  symbol  *  denotes  convolution. 

In  the  case  where  en(t)  s  o,  h€(n)(r)  ■  1  and  Hg(n)(w)  «  ff(w),  the 
Dirac  delta  distribution.  Thus,  the  result  of  the  convolution  Is  In 
this  case: 

I0(lo.  «)  ■  e+jw'  Y(«)  .  (1-5) 

Inserting  (1-5)  Into  (1-1)  produces  Brown's  result  [his  Eq.  (6) ] : 

Q0(«.  Mn)  ■  Y(w)  j  of^)  exp  {j(k  +  u)  un  •  rj  dV^)  (1-6) 

As  described  in  Brown,  the  next  step  Is  normally  to  employ  a  filter 

matched  to  Y (w).  This  filtering  accomplishes  the  pulse  compression  in 
range  and  one  then  has 

V  ■  T(«)  J  Sf^)  exp  {j(k  +  u)  un  •  rj  dV^)  (1-7) 

where  the  *  denotes  the  output  of  the  range  compression  filter  and 

T(w)  «  K(w)  Y (w) . 

From  the  above  It  follows  that  when  phase  errors  are  present,  the 
desired  Fourier  transform  relationship  between  a(x,y)  and  Q(«,un)  Is  not 
generally  true,  This  Is  significant  In  the  Interpretation  of  the  phase 
retrieval  problem  because  It  is  not  accurate  to  regard  the  phase 
retrieval  problem  simply  as  a  problem  In  Inverting  a  Fourier  transform 
In  all  cases. 
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It  true,  however,  that  we  may  still  regard  the  observables  as 
the  Fourier  Transform  of  the  desired  scene  If  en(r)  Is  assumed  to  be 
constant  during  any  one  range  sweep.  This  means  c(, (r)  is  not  a  function 
of  r  at  all.  Following  a  procedure  just  like  the  one  used  above,  It 
then  follows  that  for  phase  errors  that  vary  only  from  pulse-to-pulses 

§(«.  un)  -  exp(jen)  50(«,  un)  (1-8) 

where  Qo(w,un)  are  the  error-free  observables.  Hence,  In  this  case  the 
observables,  Q(w,un),  are  a  version  of  the  error-free  case  that  has  been 
rotated  In  phase  by  the  phase  error  en.  This  rotation  (by  different 
values  of  en)  occurs  on  each  received  range  sweep  In  our  model. 


The  key  observation  at  this  point  Is  that  under  the  most  general 
conditions  we  must  Impose  some  constraint  on  the  sequence  of  values 
(ei,  t2,  ...,  en*  •••  *n)  froni  N  echoes  If  there  Is  any  hope  of  forming 
a  SAR  Image.  If  we  assume  that  all  the  en  are  random  variables, 
mutually  Independent  and  uniformly  distributed  over  [0,  2jt],  then  It  Is 
not  possible  to  "coherently"  sum  the  N  azimuth  echoes  except  In  the 
Impractical  situation  of  trying  all  possible  combinations  of  en  that 
could  Jointly  occur  over  the  N  echoes.  [If  each  en  Is  divided  Into  8 
equal  values  (discretization  of  phase)  there  would  be  (8)(N-1)  phase 
combinations  to  try.  Even  If  only  10  echoes  were  processed  for  the  SAR 
resolution  desired,  exhaustive  search  of  all  these  possibilities 
requires  more  than  108  combinations  of  phase  to  be  tested].  With  the 
deterministic  approach  to  phase  retrieval,  the  constraints  that  are 
Imposed  on  the  {e -f }  arise  from  assumed  knowledge  of  target  shape.  The 
stochastic  approach  to  constraining  the  {e-j }  employs  knowledge  of  the 
correlation  function  of  the  random  errors  In  order  to  compensate  for 
them. 
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A  view  held  by  some  Investigators  of  the  phase  retrieval  problem  Is 
that  the  correlation  between  the  errors  might  turn  out  to  be  essentially 
zero.  This  will  depend  upon  tho  causative  mechanism  and  upon  various 
radar  parameters.  Admittedly  If  the  errors  are  uncorrelated,  then  the 
stochastic  technique  discussed  here  will  not  offer  any  Image  quality 
Improvement.  (In  fairness,  It  Is  possible  to  conjure  up  circumstances 
which  will  confound  the  deterministic  approach  also.)  However,  there 
are  many  circumstances  in  which  the  effect  of  the  phase  errors  Is  to 
degrade  rather  than  to  destroy  the  Image.  In  these  cases,  the  fact  that 
SAR  Images  can  be  formed  even  though  the  data  may  be  corrupted  by  phase 
errors,  arises  from  dependence  between  the  {en}.  Thus,  as  successive 
echoes  are  received  we  can  take  advantage  of  this  correlation.  Note 
that  this  Is  not  equivalent  to  eliminating  the  factor  exp (Jen)  In  (1-8) 
by  taking  envelopes.  Nor  Is  It  true  that  only  the  envelope  Is 
observable.  The  set  of  observables  available  to  us  Is  the  collection  of 
complex  values 

Q(«. Mi) ,  3(«.u2),  ...  fj(w,un),  ...  <3(w,un) 

from  N  received  echoes,  preserving  their  Individual  phases,  even  though 
they  contain  the  errors.  (Altermiilvely  we  could  work  directly  with  the 
data  In  the  time  domain.) 

It  turns  out  that  relative  phase  Is  the  only  thing  that  matters  In 
forming  the  coherent  sum  of  the  N  samples,  Hence,  for  N  echoes,  there 
are  (N-l)  unknown  phases.  There  are  several  ways  of  approaching  the 
problem  of  estimating  this  (N-l) -dimensional  phase  vector,  but  they  all 
rely  on  employing  some  kind  of  structural  constraint  on  the  variation  of 
en  from  one  echo  to  the  next,  as  they  should,  because  without  such  a 
structure  we  have  already  stated  that  exhaustive  search  is  all  that  Is 
left. 


STOCHASTIC  APPROACH 


One  approach  Is  to  regard  the  en  as  samples  of  an  N-dlmenslonal 
stochastic  process  that  characterizes  the  phase  errors  over  the  time 
history  Involved  in  the  coherent  Integration  process.  For  example, 
suppose  the  collection  of  Q  observables  occurring  over  N  echoes  can  be 
modelled  as  samples  from  a  Gaussian  random  process.  Then  the  N  echoes 
can  be  completely  described  rtatlstlcally  using  the  mean  and  covariance 
function  of  the  underlying  process. 

For  the  moment  assume  that  the  mean  Is  known.  Then  It  Is  no  problem 
If  we  also  assume  that  the  mean  Is  zero.  In  this  case  all  Information 
Is  embodied  In  the  covariance  function  of  the  underlying  process.  The 
random  process  we  are  speaking  of  generates  an  N-dlmenslonal  Joint 
Gaussian  probability  density  function  which  requires  specifying  the 
N  X  N  covariance  matrix  of  the  {en),  [actually  of  exp(jen)].  This 
correlation  matrix  results  from  sampling  a  continuous  process,  and  we 
can  Imagine  that  there  Is  an  underlying  covariance  function 

/>( t,s)  -  <*xp(jet)  exp(-jes)>  ■  <exp[j(et  -  e$)]>  .  (1-9) 

In  general  />(t,$)  Is  complex  and  It  may  be  separated  Into  Its  magnitude 
and  phase 

/j(t, $)  «  I /) (t , s)  I  eJ*(t’s)  .  (1-10) 

If  the  observables  can  be  modelled  as  jointly  Gaussian  (1 . e. ,  as 
samples  from  a  Gaussian  random  process),  then  It  Is  possible  to  perform 
a  series  expansion  for  any  sample  function  and  the  resulting  expansion 
Is  analagous  to  the  representation  of  a  determini stlc  time  waveform— but 

. 

1  there  are  some  Important  differences. 

i 

I 
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Suppose  we  consider  the  problem  of  representing  a  one-dimensional 
function,  v(t)t  over  an  Interval  [-T/2,  T/2] -  If  v (t)  Is  a 

deterministic  waveform,  It  has  a  Fourier  series  representation: 

vN(t)  -  ^  vk  exp(jwkt)  ;  wk  -  ^  (I-lla) 

where 

T/2  T/2 

vk  ■  J  v (t)  exp(-jwkt)  dt  ■  j  I v (t)  I  e^®^  exp(-jwkt)  dt  . 

-T/2  -T/2  (I-llb) 

Note  that  In  general  the  coefficients,  v|<,  are  complex  and  they  are 
functions  of  the  phase  function,  a(t),  of  v(t).  Thus,  we  should  more 
carefully  write  v^ (t)  -  Vfj(t;a(t)).  The  property  of  the  expansion  In 
(1-11)  Is  that  the  series  converges  everywhere  to  v(t). 

11m  lvN(t)  a(t))  -v(t)l2  -  0  (1-12) 

N-»« 

If  the  phase,  a(t),  In  (I-llb)  Is  unknown,  then  we  cannot  form  the 
series  expansion  unless  some  estimate.  a(t),  of  a(t)  Is  available. 

Leaving  aside  for  the  moment  just  how  a(t)  can  be  formulated,  Its 

utility  Is  based  on  the  requirement  [for  the  deterministic  v(t)]  that 

11m  I vM(t;  a(t))  -  v (t) 1 2  ■  0  .  (1-13) 

N-»«  " 

For  brevity  we'll  denote  v^(t;a(t))  s  v^(t).  From  (I-llb)  and 
(1-13)  It  Is  seen  that  for  the  deterministic  case,  the  problem  of 
representing  v(t)  accurately  Is  equivalent  to  knowing  I v ( t) I  and  a(t). 
In  this  case,  there  Is  no  problem  to  solve  (except  to  take  more  terms  In 


the  series)  If  we  assume  that  both  v (t)  and  a(t)  are  known.  The 
deterministic  phase  retrieval  problem  consists  of  assuming  that  a(t)  Is 
unknown  and  searching  for  conditions  under  which  a(t)  can  be  uniquely 
Inferred  from  I v (t) I .  In  the  one-dimensional  deterministic  signal  case 
there  are  no  generally  useful  conditions  under  which  this  can  be  done. 
To  overcome  the  lack  of  constraints  between  a(t)  and  I v(t) I  in  the  one- 
dimensional  deterministic  case,  one  possibility  Is  to  Invoke 
relationships  between  a(t)  and  I v(t) I  that  arise  from  exploiting  object 
shape  Information.  When  such  Information  Is  at  hand  It  Is  certainly 
worth  exploiting.  Another  possible  approach  Is  to  regard  a(t)  as  a 
known  function  of  a  finite  (and  small)  number  of  unknown  deterministic 
parameters.  A  third  alternative,  and  the  one  explored  below,  exploits  a 
different  source  of  auxiliary  Information.  So  let's  see  what  happens  If 
we  regard  the  function  v(t)  as  a  stochastic  process. 

When  v (t)  Is  a  sample  function  from  a  Gaussian  random  process,  It 
turns  out  that  there  Is  an  expansion  [Karhunen-Loeve  expansion]  for  v(t) 
that  corresponds  to  the  Fourier  series  In  (1-11).  The  desired  expansion 
is  described  as  follows: 

N 

vN(t)  »  Z  vk  Mt)  (I-14a) 

k-o  *  * 

where 

172 

vk  -  /  v(t)  *J(t)  dt  ( I- 14b) 

-T/2 

and  where  (♦k(t)}"k«l  *s  set  eigenfunctions  with  eigenvalues 
{Xk}*"k-1  that  satisfy  the  following  Integral  equation  (the 
eigenfunctions  are  ortho-normal): 
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(I-14c) 


T/2 

xk  *k^  "  /  />(*•*)  Vs)  ds  * 

-T/2 

Note  that  the  {^(t)}  are  completely  determined  from  knowledge  of 
p( t,s). 

It  can  be  shown  that  In  the  special  case  of  "stationarlty"  where 
p( t,s)  ■  p(t-s)  and  where  T  is  large  compared  to  the  reciprocal 
bandwidth  of  the  power  spectrum.  S^w)  -  ^’{^(t.s)},  the  solution  to 
(I-14c)  Is  the  set  of  functions  #|<(t)  ■  exp(J«|(t)  and  the  X|<  are  simply 
the  values  of  S ^(w|<).  Thus,  In  this  case  of  stationarlty  and  large 

observation  interval,  the  expressions  In  (I-lla,  b)  and  (I-14a,  b)  are 
algebraically  identical.  Nonetheless  the  interpretation  of  the  two 
results  Is  different  and  (I-14a,  b)  Is  more  general.  For  one  thing,  In 
the  stochastic  case,  the  most  we  can  hope  for  Is  that 

<1 1m  lvN(t)  -v(t)l2>  *  0  (1-15) 

where  <...>  denotes  expectation  (averaging)  over  the  ensemble  of 
functions  generated  by  the  underlying  random  process.  Moreover,  (1-15) 
Is  true  only  If  the  function  p( t,s)  In  (I-14c)  Is  positive  definite. 
(This  means  It  has  no  zero  eigenvalues).  This  Is  not  a  difficult 
requirement  to  satisfy  in  practice.  For  example,  the  result  of  passing 
white  noise  through  any  linear,  stable,  realizable  filter  will  yield  a 
random  process  whose  covariance  function  Is  strictly  positive  definite. 

It  should  be  pointed  out  that  the  convergence  of  v^(t)  to  v(t)  in 
(1-15)  Is  In  general  weaker  than  the  convergence  In  (1-12).  However,  If 
v (t)  Isa  sample  function  from  a  Gaussian  random  process,  then  the  two 
are  essentially  the  same  (except  on  a  set  of  measure  zero!). 
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Finally,  we  hasten  to  add  that  the  expansion  In  (1-14)  does  not 
require  statlonarlty,  so  It  Is  a  more  general  treatment  than  Fourier 
series  alone  would  permit.  Since  the  Fourier  series  approaches  the 
Fourier  transform  as  T  *  •  (In  the  deterministic  case),  we  can  expect 
conditions  we  may  be  looking  for  In  the  deterministic  case  may  have  no 
counterpart  In  the  stochastic  case  (unless  the  random  process  Is 
stationary,  of  course). 

1.3  SUMMARY 

We  now  know  that  to  represent  a  Gaussian  random  process  uniquely 
over  [-T/2,  T/2]  we  need  (assuming  zero  mean) 

(1)  p{ t,s)  -  <v(t)v*(s)> 

(2)  Positive  definiteness  of  ^(t,s) 

(3)  Any  sample  function,  v(t),  observed  over  [-T/2,  T/2], 

Thus,  If  />(t,s)  Is  completely  known,  there  Is  no  reconstruction  problem 
remaining,  and  this  Is  true  even  If  the  sample  function,  v(t),  contains 
a  phase  function  that  Is  perturbed  by  a  random  unknown  component. 
Whereas  In  the  deterministic  case  the  phase  of  the  underlying  sample  had 
to  be  considered  unknown  to  still  have  a  problem  to  solve,  In  the  random 
process  formulation  a  problem  of  Interest  occurs  as  long  as  the 
covariance  function  Is  regarded  as  Incompletely  specified.  This  Is  an 
Important  conceptual  difference  between  the  two  problems. 

The  physical  significance  of  the  above  Is  that  the  deterministic 
situation  seeks  constraints  on  the  exact  values  (for  uniqueness)  that  a 
particular  time  waveform  can  have,  whereas  In  the  random  model  the 
constraints  Involve  the  behavior  (correlation)  between  different  sample 
functions,  drawn  from  a  statistically  similar  ensemble. 
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The  stochastic  version  of  phase  retrieval 

Based  on  the  above  we  may  consider  the  following  non-trlvial  phase 
retrieval  problem  for  one-dimensional  data.  (The  notions  apply  equally 
well  for  two-dimensional  data,  but  unlike  the  deterministic  approach,  we 
do  not  encounter  an  unsolvable  one-dimensional  problem  so  we  Illustrate 
the  Ideas  only  for  that  case  now.) 

Let  v (t)  ■  aUJi-M*)  be  a  sample  function  of  a  random  process 
observed  over  [-T/2,  T/2].  Assume  <v(t)>  -  0  and 

,[0(t)  -  0(i)] 

p( t.s)  »  <*<t)  *(*)>  -  <a(t)  a*(s)>  <eJ  >  (1-16) 

where  the  "envelope"  and  "phase"  random  processes  a(t)  and  6[ t),  have 
respectively  been  assumed  to  be  mutually  Independent. 

Hence, 


/>( t.s)  «  p&( t, s)  ^(t,s)  (1-17) 

where  />a  Is  the  covariance  of  the  amplitude  and  pg  Is  the  covariance  of 
the  phase. 

From  (1-14),  If  p a  and  pg  were  known,  we  could  (In  principle)  solve 
(I-14c)  for  (#|<(t)}  and  {X|<}  and  then  obtain  our  desired  expansion.  We 
can,  however,  regard  pg  as  Incompletely  known,  and  consider  the 
estimation  of  this  function  as  the  phase-retrieval  problem. 

Estimating  oa 

Without  some  kind  of  parameterization  of  the  problem  It  Is  hopeless 
to  attempt  to  obtain  a  useful  estimate  of  pg  from  a  limited  quantity  of 
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data.  However,  In  many  cases  It  Is  reasonable  to  model  0(t)  as  the 
result  of  certain  dynamic  processes  that  must  satisfy  laws  of  motion. 

For  example,  If  0(t)  Is  the  residual  phase  error  resulting  from 
uncompensated  target/platform  motion  effects  then 

^(t)  -  MM  -  {r(tQ)  +  r(t0)  (t  -  tQ)  +  \  r(tQ)  (t  -  tQ)2} 

(1-18) 

where  r(t0),  r(t0),  r(t0)  are  random  variables,  fixed  during  one  record. 
Then  estimating  pg  Is  equivalent  to  estimating  these  constants  based  on 
the  data  collected  over  an  Interval  [-T/2,  T/2],  This  can  be 
Interpreted,  In  some  cases,  as  equivalent  to  estimating  certain 
parameters  of  the  observed  processes'  power  spectrum,  such  as  Its 
bandwidth.  It  Is  possible  to  employ  the  rigors  of  statistical 
estimation  theory  to  derive  suitable  estimates  of  the  required 
parameters  of  fl(t)  using  the  corrupted  observable  data,  and  It  Is  also 
possible  to  derive  expressions  for  the  accuracy  attainable  In  estimating 
P9- 

The  theory  required  to  develop  the  required  estimates  exists  and 
does  not  require  any  conceptual  or  mathematical  breakthroughs— but  It  1_s 
different  from  the  deterministic  phase  retrieval  problem. 

It  has  been  demonstrated  that  the  problem  of  estimating  the  phase  In 
a  phase-corrupted  version  of  received  radar  data  can  be  viewed  as  a 
problem  of  estimating  a  portion  of  the  received  data's  correlation 
function.  This  viewpoint  Is  capable  of  producing  useful  phase  estimates 
even  on  one-dimensional  (time-series)  data  without  requiring  any 
Information  on  target  shape.  Instead,  a  priori  Information  about  the 
physics  of  the  dynamic  processes  that  cause  the  undesired  phase 
disturbance  Is  exploited.  The  Ideas  suggested  here  are  therefore 


adjuncts  to  the  deterministic  phase  retrieval  methods  currently  being 
pursued.  Use  of  both  approaches  should  further  enhance  phase  retrieval 
performance. 
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